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INHIBITION OF GENE TRANSCRIPTION BY POLY AMIDE DNA-BINDING LIGANDS 



5 

The U.S. Government has certain rights in this invention pursuant to Grant Nos. GM 
26453, 27681, 47530 and AI 29182 awarded by the National Institute of Health. 

10 CROSS REFERENCE TO RELATED APPLICATIONS 

This apphcation is a continuation-in-part of U.S. Provisional Applications S,N, 
60/038,384, filed February 14, 1997, S.N. 60/038,394, filed February 14, 1997, S.N. 60/[CIT 
2683], filed September 2, 1997, and S.N. 60/ICIT 2684], filed September 10, 1997, which are 
incorporated by reference, U.S. application 08/853,022, filed April 21, 1997, and 
15 PCT/US97/12722, filed July 21, 1997. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

20 This invention relates to polyamides that bind to predetermined sequences in the minor 

groove of double stranded DNA that are useful for diagnosis and treatment of diseases 
associated with gene transcription. This invention is related to modulation of cellular or viral 
gene expression required for maintenance and replication of pathogens in infectious disease, 
such as HIV-1 and CMV. This invention is also related to modulation of cellular gene 

25 expression in non-infectious disease conditions, such as cancers involving oncogenes, e,g., her- 
2/neu. 

Gene Therapy Approaches for HIV: 

30 Considerable effort has been expended over the past decade to devise methods to 

interfere with HIV-1 gene expression in living cells in the hope that therapeutic strategies will 
come from these studies (recently reviewed in Kohn, D.B. and N. Sarver, Gene therapy for 
HIV'l infection^ in Antiviral Chemotherapy, J. Mills, P.A. Volberding, and L. Corey, Editors. 
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1996, Plenum Press: New York. p. 421-427.). One approach includes interference with the 
translation of messenger RNA into protein by the mtroduction of antisense oligonucleotides 
into lymphoid cells, as discussed in Kohn, D.B. and N. Sarver, Gene therapy for HIV-l 
infection, in Antiviral Chemotherapy, J. Mills, P.A. Volberding, and L. Corey, Editors. 1996, 
5 Plenum Press: New York. p. 421-427; Bordier, B., et al. Proa Natl Acad. ScL USA,, 92: 
9383-9387 (1995) and Lisziewicz, J., et al , Proa Natl Acad. Sci, USA, 91: 7942-7946 
(1994). 

Another approach involves ribozyme-mediated destruction of specific regions of HIV-l 
10 mRNA. See Sun, L.Q., et al. Proa Natl Acad. Scl USA,, 92: Ull-ine (1995); Yamada, 
O., et al , 1 Virol, 70: 1596-1601 (1996) and Zhou, C, et al , Gene, 149: 33-39 (1994). 
Decoy molecules, corresponding to HIV-l RNA domains that bind regulatory proteins required 
for the HTV-l life cycle (TAR RNA which binds Tat or the Rev-response element) have been 
used as inhibitors of HIV-l replication (Sullenger, B.A., et al , CeU, 63: 601-608 (1990). In 
15 addition, /ran^-dominant mutant versions of these regulatory proteins, introduced into cells 
with retroviral expression vectors, have been shown to inhibit HIV-l rephcation (Bevec, D., et 
al , Proa Natl Acad ScL U S A„ 89: 9870-9874, 1992.). 

Other approaches for direct inhibition of gene transcription, including designed or 
20 selected zinc finger peptides that recognize pre-determined DNA sequences, are described in 
Wu, H., et al.. Proa Natl Acad. Scl USA., 92: 344-348 (1995) and Thiesen, H.-J., Gene 
Expr., 5: 229-243 (1996). DNA-cleaving ribozymes have also been tried (Raillard, S.A. and 
G.F. Joyce, Biochemistry, 35: 11693-11701(1996)). Triple helix-forming oligonucleotides 
have been used to block HTV-l integration: Bouziane, M., et al., J. Biol Chem., Ill: 10359- 
25 1 0364 (1996). Triple helix-forming oligonucleotides have also been used specifically cleave 
HIV-l DNA with a metalloporphyrin group attached to the oligonucleotide, as described by 
Bigey, P., G. Pratviel, and B. Meunier, Nucleic Acids Res., 23: 3894-3900 (1995). 
Additionally, the DNA-binding caHcheamicin oUgosaccharides have the potential for use in 
anti-HIV-1 therapy but have not as yet been applied to this disease. See Ho, S.N., et al. Proa 
30 Natl Acad ScL, 91: 9203-9307 (1994) and Liu, C, et al, Biochemistry, 93: 940-944 (1996). 

For any gene therapy approach to be successful, several criteria must be met by the 
therapeutic agent: First, the agent must not possess any general cell toxicity and should not 
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elicit an immune response. Second, the agent must be cell-pemieable or amenable to viral 
delivery and, in the case of the DNA-binding agents, the therapeutic agent must transit to the 
nucleus and bind the target sequence with high affinity and specificity in the context of cellular 
chromatin. Third, binding of the agent to its DNA or RNA target sequence must interfere with 
5 gene transcription or protein translation. 

Each of the potential approaches listed above has its own unique advantages and 
limitations. For example, while nucleic acid-based approaches (antisense, decoy and triple 
helix-forming oligonucleotides and ribozymes) have the potential for sequence selectivity and 

10 can effectively inhibit transcription or translation in vitro, these molecules suffer fix)m poor cell 
permeability and other delivery systems, such as retroviral vectors in the case of the ribozymes 
(Zhou, C, et al, 1994) or liposomes or other delivery strategies in the case of antisense or 
triple helix oligonucleotides, must be used for effective gene inhibition (reviewed in Kohn & 
Sarver, 1996). Similarly, zinc finger peptides must be introduced via a gene therapy approach 

15 with an appropriate viral expression vector since these peptides cannot directly enter cells. See 
Choo, Y., et al , Nature, 372: 642-645 (1994). 

One additional problem with gene therapy approaches is that they must be performed 
on lymphoid cells ex vivo and, once an "HTV-protected" cell population is established, these 
20 cells must then be introduced into the patient. 

In contrast to gene therapy approaches, HIV protease inhibitors taken in combination 
with standard anti-retroviral agents (AZT) have recently shown success in clinical trials. Wei, 
X.etal, Nature, 117-122(1995); Ho, DD. era/., iVa^ure, 373: 123-126(1995). 

25 

The key to tlie anti-HIV properties of these drugs is that they strike at two separate 
phases of the virus life cycle, limitmg the ability of spontaneous mutations to result in 
inhibitor-resistant strains of the virus. Small molecule inhibitors of HTV-l RNA transcription 
which would target a third phase of the virus life cycle would be highly desirable. Cell- 
30 permeable sequence-specific DNA-binding ligands would circumvent the problems associated 
with other forms of gene therapy and could compliment the protease inhibitor-anti-retroviral 
agent cocktail approach mentioned above. The calicheamicin oUgosaccharides satisfy some of 
the requirements for a therapeutic agent; these molecules are sufficiently hydrophobic to pass 
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through cell membranes but these molecules possess severely limited sequence specificity (4 
base pairs) and bind DNA with very low affinities (100 ^iM or higher required for inhibition of 
protein-DNA interactions . See Ho, S.N., et al, Proc, Natl Acad. ScL, 91: 9203-9307 (1994) 
and Liu, C, et al, Biochemistry, 93: 940-944 (1996)). 

5 

Thus, new classes of cell-permeable molecules that possess higher degrees of DNA 
sequence specificity and affinity are needed for the treatment of AIDS and other infectious 
diseases. We describe below the successful development of a new class of highly specific 
designed small molecule ligands with great potential for inhibition of HTV-l gene transcription. 

10 

The HIV-1 Enhancer and Promoter: 



A recent review has summarized our current knowledge of the protein factors required 
for the control of RNA initiation and elongation by RNA polymerase II at the HIV-1 promoter 

15 (Jones, K.A. and B.M. Peterlin. 1994. Control of RNA initiation and elongation at the HIV-1 
promoter. Annu, Rev, Biochem., 63: 717-743). Thus only those aspects of HIV-l transcription 
that relate to transcription inhibition are discussed herein. For HTV, the template for synthesis 
of both new viral RNA and messenger RNA (for viral protein synthesis) is the integrated 
provirus, the product of reverse transcription of the viral RNA in the infected cell. HIV-1 

20 utilizes the transcription machinery of the host cell but encodes its own /ra«j-activators, Tat 
and Rev, that are responsible for RNA elongation and utilization. The HIV-1 promoter is 
located in the U3 region of the leftward (5*) long terminal repeat (see Figure 11 below, taken 
firom Jones and Peterlin, 1994). The core promoter and enhancer elements span a region of 
approximately 250 base pairs and include TATA and initiator elements and the binding sites 

25 for the following cellular transcription factors: Spl, NF-kB, LEF-1, Ets-1 and USF. 
Sequences upstream of the NF-kB sites contribute only marginally to HIV-1 promoter activity 
either in vitro or in transfected cell lymphoid cell Unes. Waterman, M.L. and K.A. Jones, New 
Biologist, 2: 621-636 (1990). However, these upstream sequences, and presumably the protein 
factors which bind these upstream sequences, are important for viral replication, and hence 

30 promoter activity, in peripheral blood lymphocytes and in some T cell hues. Kim, J., et 
a/,. 7. Virol 67: 1658-1662 (1993). 
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Two of the binding sites in the upstream region correspond to recognition sites for 
activator proteins that are lymphoid cell specific (or highly enriched in T cells) and are shared 
with the promoter of the T cell receptor (TCRa) gene: these are the Ets-1 and LEF-1 
transcription factors. The essential role of the upstream region has recently been reproduced in 
5 vitro with a chromatin reconstitution assay (Sheridan, P.L., et ai, Genes Dev., 9: 2090-2104 
(1995)). 

Packaging of the HIV-1 promoter into nucleosomes strongly repressed transcription, 
but this repression could be relieved by pre-incubation of the template with the HIV-1 

10 enhancer-binding proteins, LEF-1 and Ets-1. LEF-1 and Ets-1 thus apparently act in concert to 
prevent nucleosome-mediated repression in vivo. Inhibition of formation of this complex by 
small molecule inhibitors could well represent a viable target for HIV-1 gene therapy. LEF-1 
is a member of the HMG family of proteins and binds DNA as a monomer. DNA binding is in 
the minor groove and results in a large distortion of the DNA helix (unwinding and bending) 

15 (Love, J.J., et al. Nature, 376: 791-795 (1995)). 

In addition to acting as an architectural transcription factor, LEF-1 possesses a strong 
frfltK^-activation domain which can function when artificiality transferred to other DNA- 
binding proteins (Giese, K., et al. Genes Dev„ 9: 995-1008 (1995)). This region of the HIV-1 
20 enhancer might thus prove to be an effective target for inhibition of viral transcription and 
hence virus replication. 

The HIV-1 promoter also contains tandem binding sites for NF-kB, a factor that is 
strongly induced by HIV infection (Bachelerie, F., et al Nature, 350: 709-712 (1991)) and 

25 multiple binding sites for the general transcription factor Spl. The mechanisms of NF-kB 
activation have been reviewed by Jones and Peterlin, 1994. Important for this discussion, NF- 
kB has been shown to contact both Spl and the TBP subunit of the basal transcription factor 
TFIID. Perkins, N.D., et al, Mol Cell Biol, 14: 6570-6583 (1994). Additionally, Spl has 
been shown to interact with the TAFllO subunit of TFIID (llOkDa TBP-associated factor) 

30 (Chen, J.L., et al, CeU, 79: 93-105, 1994). It is the binding of TFHD via the TBP interaction 
with the TATA element that nucleates the assembly of the complete RNA polymerase II 
transcription complex (reviewed in Maldonado, E. and D. Reinberg, Current Opinion in Cell 
Biology, 7: 352-361, 1995). Thus, NF-kB may function through recruitment of Spl and TFIID 
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to the HIV-1 promo tor via these protein-protein interactions. Thus blocking the NF-kB-DNA 
and/or Spl-DNA interaction is another potential target for HIV therapy. Since these factors, 
and especially Spl and TFIID, are utilized by a wide range of cellular genes, the binding sites 
for these factors would not be good targets for HIV-specific inhibition (or any gene-specific 
5 inhibition). However, the sequences adjacent to these sites, that are unique to HIV-1 proviral 
DNA, are excellent candidate targets for the design of inhibitory DNA ligands (see below). 



Polyamide DNA-binding Ligands: 

Simple rules have been developed to rationally determine the sequence-specificity of 

10 minor-groove-binding polyamides containing 7V-methylimidazole and iV-methylpyrrole amino 
acids. These synthetic pyrrole-imidazole polyamides bind DNA with excellent specificity and 
very high affinities, even exceeding the affinities of many sequence-specific transcription 
factors (Trauger et al,. Nature, 382: 559-561, 1996). An Im/Py pair distinguishes G*C fcom 
C*G and both of these fi-om A*T or T»A base pairs. Wade, W.S., et al. describes the design of 

15 peptides that bind in the minor groove of DNA at 5'-WGWCW-3' sequences (where W is 
either A or T, and a W»W pairs is an A*T or a T»A base pairs by a dimeric side-by-side motif 
1 Am, Chem, Soc, 114, 8783-8794 (1992); Mrksich, M. et al describes antiparallel side-by- 
side motif for sequence specific-recognition in the minor groove of DNA by the designed 
peptide l-methylimidazole-2-carboxamidenetropsin. Proa, Natl Acad. Sou USA 89, 7586- 

20 7590 (1992); Trauger, J.W., et al., describes the recognition of DNA by designed ligands at 
subnanomolar concentiations. Nature 382, 559-561 (1996). A Py/Py pair specifies A*T fi-om 
G*C but does not distinguish A»T fi-om T*A. Pelton, J.G. & Wemmer, D.E. describes the 
structural characterization of a 2-1 distamycin A-d(CGCAAATTTGGC) complex by two- 
dimensional NMR, Proa Natl Acad Set USA 86, 5723-5727 (1989); White, S., et al. 

25 Biochemistry 35, 12532-12537 (1996) describes the effects of the A»TyT*A degeneracy of 
pyrrole-imidazole polyamide recognition in the minor groove of DNA, the pairing rules for 
recognition in tlie minor groove of DNA by pyrrole-imidazole polyamides, and also describes 
the 5 '-3' N-C orientation preference for polyamide binding in the minor groove. 

30 It has been found that a new aromatic amino acid, 3-hydroxy-N-methylpyrrole (Hp) 

when incorporated into a polyamide and paired opposite Py, provides the means to discriminate 
A^^ from T«A. White, S., et al. Nature 391 436-438 (1998). Unexpectedly, the replacement 
of a single hydrogen atom on tlie pyrrole with a hydroxy group in a Hp/Py pair regulates the 
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affinity and the specificity of a polyamide by an order of magnitude. Utilizing Hp together 
with Py and Im in polyamides to form four aromatic amino acid pairs (Im/Py, Py/Im, Hp/Py, 
and Py/Hp) provides a code to distinguish all four Watson-Crick base pairs in the minor groove 
ofDNA. 

5 

The prefeired corresponding designed specific polyamides resulting from this invention 
are of the form 

X1X2 . . . Xm-Y-X(m + 1) . . . X(2m-l)X2m-p-Dp 

10 wherein X], X2, Xm, X(ni +1), X(2m - 1)> and X2m are carboxamide residues 

forming carboxamide binding pairs Xi/X2m, X2/X(2ni-1), Xm/X(m + 1), and y is y- 
aminobuytic acid or 2,4 diaminobutyric acid and Dp is dimethylaminopropylamide, 
and where 

carboxamide binding pair X\fX2m corresponds to base pair Ni»N'i, 
15 carboxamide binding pair X2/X(2ni-1) corresponds to base pair N2«N'2, 

carboxamide binding pair Xm/X(m+1) corresponds to base pair Nm^N'm- 

In general, tlie specific polyamide DNA-binding ligands were designed by using a 
method that comprises the steps of identifying the target DNA sequence 5'-WNiN2 
20 NmW-3'; representing the identified sequence as 5'-Wfl6 . . . jcW-3', wherein a is a first 
nucleotide to be bound by the Xi carboxamide residue, 6 is a second nucleotide to be bound by 
the X2 carboxamide residue, and x is the corresponding nucleotide to be bound by the Xm 
carboxamide residue; defining a as A, G, C, or T to correspond to the first nucleotide to be 
bound by a carboxamide residue in the identified six base pair sequence. 

25 

Carboxamide residues were selected sequentially as follows: Im was selected as the Xi 
carboxamide residue and Py as tile X2m carboxamide residue if a was G. Py was selected as 
the Xi carboxamide residue and Im as the X2m carboxamide residue if a was C. Hp was 
selected as the Xi carboxamide residue and Py as the X2m carboxamide residue if a was T. 
30 Py was selected as the Xi carboxamide residue and Hp as the X2m carboxamide residue if a 
was A. 
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The remaining carboxamide residues were selected in the same fashion. Im was 
selected as the X2 carboxamide residue and Py as the X2m-1 carboxamide residue if b was G. 
Py was selected as tlie X2 carboxamide residue and Im as the X2m-1 carboxamide residue if b 
5 was C. Hp was selected as tlie X2 carboxamide residue and Py as the X2m-1 carboxamide 
residue if b was T. Py was selected as the X2 carboxamide residue and Hp as the X2m-1 
carboxamide residue if b was A. 

The selection of carboxamide residues was continued through m iterations. In the last 
10 iteration, Im was selected as tlie Xm carboxamide residue and Py as the Xm+i carboxamide 
residue if x was G. Py was selected as the Xm carboxamide residue and Im as the Xm+l 
carboxamide residue if x was C. Hp was selected as the Xm carboxamide residue and Py as 
the Xm+1 carboxamide residue if x was T. Py was selected as the Xm carboxamide residue 
and Hp as the Xm+1 carboxamide residue if jc was A. 

15 

In one prefen*ed embodiment, the polyamide includes at least four consecutive 
carboxamide pairs for binding to at least four base pairs in a duplex DNA sequence. In another 
preferred embodiment, tlie polyamide includes at least five consecutive carboxamide pairs for 
binding to at least five base pairs in a duplex DNA sequence. In yet another preferred 
20 embodiment, the polyamide includes at least six consecutive carboxamide pairs for binding to 
at least six base pairs in a duplex DNA sequence. In one preferred embodiment, the improved 
polyamides have four carboxamide binding pairs that will distinguish A*T, T*A, C*G and G«C 
base pairs in the minor groove of a duplex DNA sequeiice. 

25 DNA target sequence recognition thus depends on a code of side-by-carboxamide 

residue pairings in tlie minor groove of double stranded DNA. These compounds represent the 
only class of synthetic small molecules that can bind predetermined DNA sequences with 
affinities and specificities comparable to DNA-binding proteins. 

30 
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Summarv Of The Invention 

This invention provides specific polyamides that are useful for modulating the 
expression of a cellular or viral gene by binding to predetermined target sequences adjacent to 
5 the binding site of a transcription factor protein in the minor groove of double stranded DNA. 
Suitable cellular genes include both eukaiyotic and prokaryotic genes. The cellular gene can 
be present in the original native cells, in cells transfected or transformed with a recombinant 
DNA construct comprising the cellular gene or in an in vitro in a cell-free system. The viral 
gene can be present in a cell or in an in vitro in a cell-free system. 

10 

The polyamides of the present invention can act as specific inhibitors of gene 
transcription in vivo or in vitro as therapeutic agents in disease conditions related to the 
transcription of at least one cellular or viral gene. Studies with three accepted model systems 
have shown that polyamides do interfere with the binding of sequence-specific minor groove 
15 transcription factor proteins as well as with components of the basal transcription machinery 
and thus block transcription of target genes. 

Hereinafter, N-metliylpyrrole carboxamide may be referred to as "Py", N- 
methylimidazole carboxamide may be referred to as "Im", 3-hydroxy-N-methylpyrrole 
20 carboxamide may be referred to as 'Hp", y-aminobutyric acid may referred to as 'V, p-alanine 
may be referred to as **P", glycine may be referred to as "G", dimethylaminopropylamide may 
be referred to as "Dp'\ and ethylenediaminetetraacetic acid may be referred to as "EDTA". 

The invention encompasses polyamides having y-aminobutyric acid or a substituted y- 
25 aminobutyric acid to fomi a hairpin with a member of each carboxamide pairing on each side 
of it. Preferably the substituted y-aminobutyric acid is a chiral substituted y-aminobutyric acid 
such as (R)-2,4-diaminobutyric acid. In addition, the polyamides may contain an aliphatic 
amino acid residue, preferably a P-alanme residue, in place of a Hp or Py carboxamide. The P- 
alanine residue is represented in formulas as p. The p-alanine residue becomes a member of a 
30 carboxamide binding pair. The invention further includes the substitution as a p/p binding pair 
for non-hn containing binding pair. Thus, binding pairs in addition to the Im/Py, Py/Im, Hp/Py 
and Py/Hp are Im/p, p/Im, Py/p, p/Py, Hp/p, p/Hp, and p/p. 



WO9835702 [ file:/A\cadmrfs01\rirmdataVlp\FolevPat\PatentDQCuments\WO9835702.cp cl 



WO 98/35702 



PCr/US9g/02444 



Page 12 of 113 



- 10- 

The polyamides of tlie invention can have additional moieties attached covalently to the 
polyamide. Preferably tlie additional moieties are attached as substituents at the amino 
terminus of the polyamide, the carboxy terminus of the polyamide, or at a chiral (R)-2,4- 
diaminobutyric acid residue. Suitable additional moieties include a detectable labeling group 
5 such as a dye, biotin or a hapten. Otlier suitable additional moieties are DNA reactive moieties 
that provide for sequence specific cleavage of the duplex DNA. 

A central aspect of the present invention is the use of imique or rare sequences adjacent 
to the binding sites for common transcription factors as the target sequences for the design of 

10 polyamides. It has been found that (1) sequences adjacent to the binding sites for required 
transcription factors are unique to the genes under study and are not found in other genes in the 
current nucleic acid data bases; (2) polyamides targeted to these sequences are effective 
inhibitors of protein-DNA interactions; (3) such polyamides are inhibitors of transcription 
factor-dependent gene transcription in vitro\ and (4) the polyamides are cell permeable agents 

15 and have been shown to inhibit transcription of target genes in human cells in culture. 

Most importantly, several designed polyamides have been shown to inhibit 
transcription of specific genes in vivo and thus these compounds must be both cell permeable 
and once inside tiie cell they must be able to transit the nuclear envelope and bind their target 
20 sites within chromatin (Gottesfeld, J.M., et al, Nature, 387: 202-205, 1997). These results 
demonstrate that designed pynole-imidazole polyamides are useful in the treatment of diseases, 
particularly viral diseases, including AIDS, as well as many other diseases for which specific 
candidate gene targets have been identified. 

25 The present invention provides specific polyamides which inhibit the transcription of 

DNA upstream or downstream of transcriptional factors such as the 5 S RNA gene 
transcriptional factor TFIEA, tlie minor groove-binding protein TATA-box binding protein 
(TBP), Ets-1 and llie lymphoid enhancer factor LEF-1 protein. These polyamides act as gene- 
specific inhibitors of transcription since these polyamides are selective for the sequences 

30 flanking these protein binding sites that are, in turn, gene-specific. The polyamides are useful 
as therapeutics for the treatment of cancer as well as for the treatment of diseases caused by 
vimses and other pathogens (such as bacterial, fungal, etc.) 



WO9835702 [file:/A\cadmffs01\firmdata\]p\FoievPat\PatentDocuments\WO9835702.cpcl 



-WO 98/35702 



PCT/US98/02444 



Page 13 of 113 



- 11 - 

The present invention provides a composition comprising a transcription inhibiting 
amount of at least one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-P- 
Dp, ImPy-p-hnPy-y-ImPy-p-IniPy-P-Dp and mixtures thereof and a pharmaceutically 
acceptable excipient suitable for the treatment of HTV-l infection. The invention also provides 

5 a method of treating a human patient with an HTV-l infection comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least one 
polyamide chosen from tlie group consisting of hnPyPyPy-y-ImPyPy-p-Dp, ImPy-P-ImPy-y- 
ImPy-P-ImPy-P-Dp and mixtures thereof and a pharmaceutically acceptable excipient. 
Preferably a transcription inliibiting amount corresponds to an extracellular concentration of 

10 polyamide of about 100 nanomolar to about 10 micromolar. In one preferred embodiment, a 
transcription inhibiting amount corresponds to an extracellular concentration of about one 
micromolar to about ten micromolar LnPyPyPy-y-ImPyPy-P-Dp mixed with about one 
micromolar to about ten micromolar IniPy-P-ImPy-y-ImPy-P-IniPy-P-Dp, 

15 The present invention provides methods of treating cells in vitro as well as treating a 

human patient or a non-human organism in vivo. In one preferred embodiment, the invention 
provides a method of treating HIV-1 infected human blood cells in vitro comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least one 
polyamide chosen from tlie group consisting of ImPyPyPy-y-ImPyPy-p-E>p, ImPy-P-ImPy-y- 

20 ImPy-P-LnPy-P-Dp and mixtures thereof. 

In other embodiments, tlie invention provides a diagnostic kit for detecting the 
identified target DNA sequence by employing the selective polyamides and a suitable system 
for the detection of the polyamide bound to the DNA. 

25 

Brief Description oFthe Drawings 

Figure 1 is a representation of a schematic model for the interaction of the nine zmc 
finger protein TFIIIA witli tlie 5S ribosomal RNA gene internal control region (ICR). 

30 

Figure 2 is a representation of the results of DNase I footprinting analysis of the 
binding of polyamide 1 and TFIIIA to tlie 5S RNA gene ICR. 



WO9835702 f file:/A\cadmrfsQ1\firmdata\ip\FolevPat\PatentDocuments\WO9835702.c pc1 



WO 98/35702 



PCT/US98/02444 



Page 14 of 113 



-12- 

Figure 3 is a representation of the results of experiments demonstrating inhibition of 5S 
RNA gene transcription in vitro. 

Figure 4 is a representation of the results of an experiment demonstrating the inhibition 
5 of 5S RNA gene transcription complex formation in vivo. 

Figure 5 is a schematic representation of the HTV-l genome that encodes Gag, Pol, 
Env and six regulatory proteins as well as anticipated formally matched complexes of 1 and 4, 
and formally mismatched complexes of 2, 3, 5, and 6 with the HTV-l target sequence 5*- 
10 TGCTGCA-3'. 

Figure 6 is a schematic representation of the structures of 5-y-5 polyamides 1-3, and 2- 
P-2-y-2-p-2 polyamides 4-6. 

15 Figure 7 is a schematic representation of part of the sequence of the restriction fragment 

used in quantitative DNase I footprinting experiments. The three 7 base pair target sites 5'- 
TGCTGCA-3' (the HIV-1 target site), 5'-TGGTGGA-3', and 5'-TGTTACA-3' are 
highlighted. 

20 Figure 8 is a representation of the results of a footprint experiment demonstrating the 

binding of polyamides 4-6 and 3. 

Figure 9 is a representation of the results of a MPE*Fe"^ footprint experiment using 
polyamides 4, 5, 6, and 3 at 1 nM. 

25 Figure 10 is a representation of the results of a MPE*Fe^ footprint experiment showing 

binding of a seven base pair sequence by polyamides 4, 5, 6, and 3. 

Figure 1 1 is a representation of the sequence of the HIV- 1 enhancer / promoter element 
showing the binding sites for the cellular transcription factors Ets-1, LEF-1, NF-kB and SPl 
30 along with a canonical TBP binding site, or "TATA element." 
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Figure 12 is a representation of the results of an experiment demonstrating inhibition of 
LEF-1 binding to tlie HTV-l enhancer by polyamide L 

Figure 1 3 is a representation of the results of an experiment demonstrating binding to 
5 wild type and mutant forms of the HIV-1 enhancer by polyamide 1. 

Figure 14 is a representation of the results of an DNase footprint experiment 
demonstrating binding to the HIV-1 enhancer of the polyamide ImPy-p-ImPy-y-ImPy-p-ImPy- 
P-Dp,also called 

10 

Figure 15 is a representation of the inhibition of HTV-l transcription by the polyamide 
ImPy-P-ImPy-Y-ImPy-(J-ImPy-p-Dp, also called HIV-1, 

Figure 1 6 is a schematic model of the binding of the polyamides IraPy-P-ImPy-y-ImPy- 
15 p-ImPy-p-Dp, also called and Imlm-P-Imlm-y-PyPy-p-PyPy-P-Dp, also caUed HIV'2 

to the DNA sequences adjacent to the HIV-1 TATA box. 

Figure 17 is a schematic model of the binding of polyamides designed for the 
recognition of TFIID (la - Ic) and Ets-l/LEF-1 (2a-2b) binding sites. 

20 

Figure 18 is a representation of the HTV-l promoter and DNA-binding sites for 
polyamide hairpins designed to target the HIV- 1 promoter. 

Figure 19 is a representation of the results of a footprint experiment comparing the 
25 binding of polyamides and TBP. 

Figure 20 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and LEF-1. 

30 Figure 21 is a representation of the results of a footprint experiment comparing the 

binding of polyamides, HIV-1 and CMV. 
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Figure 22 is a graphical representation of the results of a experimental treatment of 
human white blood cells infected with HIV. 

Figure 23 is a graphical representation of the results of a experimental treatment of 
5 human white blood cells infected with HIV. 

Figure 24 is a graphical representation of the results of a experiment demonstrating the 
specificity of polyamides. 

10 Figure 25 is a schematic representation of polyamide binding to the DNA sequence 

adjacent to the Ets-1 site. 

Figure 26 is a representation of the results of a footprint experiment comparing the 
binding of polyamides and AN331 to the HIV-1 promoter region. 

15 

Figure 27 is a representation of the results of experiments showing the specific Ets-1 
binding inhibition produced by polyamides. 

Figure 28 is a representation of the results of a footprint experiment comparing the 
20 binding of polyamides and AN331 to the HIV-1 promoter region. 

' Figure 29 is a schematic model of the binding of polyamides designed for the 
recognition of DNA sequence adjacent to the CMV TBP binding site and the repressor binding 
site. 

25 

Figure 30 is a representation of the results of a footprint experiment comparing the 
binding of polyamides 1 and 5 and IE86 to the repressor region of the CMV genome. 

Figure 3 1 is a representation of the results of a footprint experiment comparing the 
30 binding of polyamides 3 and 4 and IE86 to the repressor region of the CMV genome. 

Figure 32 is a representation of the results of an experiment comparing the binding of 
polyamide 1 and IE86 on the transcription of CMV RNA. 
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Detailed Description of Preferred Embodiments 

Pyrrole-imidazole polyamides represent the only class of small molecules to date that 
5 can bind any predetermined DNA sequence. DNA recognition depends on side-by side amino 

acid pairings in the minor groove. A pairing of imidazole (Im) opposite pyrrole (Py) targets a 

G-C base pair. A Py/Py combination is degenerate and targets both T-A and A-T base pairs. 

However, Hp/Py is specific for a T-A base pair and Py/Hp is specific for an A-T base pair. 

The generality of these pairing rules has been demonstrated by targeting a variety of sequences 
10 5-13 base pairs in size and is supported directly by NMR structural studies. Eight-ring hairpin 

polyamides have affinities and specificities comparable to DNA-binding transcription factors. 

Solid phase methods for synthesizing the polyamides of the present invention have been 
described by Baird & Dervan (J. Am, Chem, Soc, 118, 6141 (1996)). Alternatively, the 
15 polyamides can also be synthesized via solution phase methods as described by Weiss et al. {J, 
Am, Chem, &c., 79, 1266 (1957)). 

Many protein coding genes utilize both gene- and tissue-specific transcription factors as 
well as general transcription factors for transcription of mRNA by RNA polymerase II. The 

20 binding sites for these protein factors are found in numerous genes, whereas the sequences 
adjacent to these binding sites tend to be unique for each gene. Polyamide ligands can be 
designed which target both the sequences adjacent to the binding sites for these transcription 
factors as well as to the binding sequences for these factors. Polyamides that target these 
sequences will interfere with the binding of the protein factors to DNA and thereby inhibit 

25 transcription by RNA Polymerase n. 

For example, the role of tlie nine zinc finger protein TFIHA in the transcriptional 
regulation of the 5S RNA gene by RNA polymerase HI has been extensively characterized. 
Zinc fingers 1-3, 5 and 7-9 bind tlje intemal control region (ICR) of the gene through base- 
30 specific interactions in the major groove. Fingers 4 and 6 are essential for high affinity DNA 
binding and have been proposed to bind m or across the minor groove (Figure 1). Well 
established methods exist for assessing in living cells the status of RNA polymerase HI 
transcription complexes on the genes encoding the small 5S ribosomal RNA. 
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In another embodiment, polyamides have been designed and synthesized that recognize 
and bind the sequences immediately adjacent to the site at which the minor groove-binding 
protein TATA-box binding protein (TBP) binds to TATA DNA can be designed. DNA 
5 sequences adjacent to the TATA elements are gene-specific whereas TATA elements are found 
in many protein-coding genes. For example, a polyamide boimd to the sequence adjacent to 
the HIV-1 TATA element has been shown to inhibit HIV-1 promoter-specific transcription by 
RNA polymerase 11. A polyamide designed to selectively bind this site would be usefiil for 
treating diseases associated with HIV-1 infection. 

10 

In a third embodiment, a polyamide recognizes and binds the sequence inmiediately 
adjacent to the binding site of lymphoid enhancer factor LEF-1 in.the HIV-1 enhancer. This 
polyamide inhibits the binding to LEF-1 protein to HIV-1 DNA. As above, a polyamide 
designed to selectively bind this site would be useful for treating diseases associated with HTV- 
15 1 infection. 

In another embodiment, a polyamide recognizes and binds to an identifed target 
sequence adjacent to the transcription factor protein binding site of a cellular gene. In one 
preferred embodiment, the cellular gene is a constitutively expressed gene under basal 
20 transcription control. An prefened cellular gene under basal transcription control is the gene 
encoding the 5S ribosomal subunit. 

In yet another preferred embodiment, the minor groove transcription factor protein of 
the cellular gene is TBP. Such preferred cellular genes include oncogenes such as LEF-1, Ets- 
25 1 and her-2/neu. Other such preferred cellular genes include genes encoding cytokines such as 
interleukins, including IL-2, IL-5 and IL-13, tumor necrosis factors, including TNF-alpha and 
TNF-beta, growth factors, including TGF-beta, and colony stimulating factors, including GM- 
CSF. 

30 Using the above described rules, a sequence-specific polyamide can be designed that 

selectively binds to an identified target site adjacent to the binding site of a minor groove 
transcription factor protein. As used herein, "adjacent" includes 1) polyamide binding sites 
wherein an end nucleotide base pair of tlie polyamide binding site is immediately continguous 
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to an end nucleotide of the minor groove transcription factor protein binding site, 2) polyamide 
binding sites wherein one to five nucleotide base pairs of the polyamide binding site are shared 
with the binding sites of the minor groove transcription factor protein, and 3) polyamide 
binding sites wherein the polyamide binding site is separated j&om the minor groove 
5 transcription factor protein binding site by from one to four intervening nucleotide base pairs. 
The binding affinity of such a designed polyamide should be greater than the binding affinity 
of the native transcriptional element in order to inhibit transcription. The binding affinity can 
be ascertained by competitive inliibition experiments against a native transcription factor. 

10 Within this application, unless otherwise stated, definitions of the terms and illustration 

of the techniques of this application may be found in any of several well-known references 
such as: Sambrook, J., et al. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory Press (1989); Goeddel, D., ed.. Gene Expression Technology, Methods in 
Enzymology, 185, Academic Press, San Diego, CA (1991); "Guide to Protein Purification" in 

15 Deutshcer, M.P., ed., Methods in Enzymology, Academic Press, San Diego, CA (1989); Innis, 
et al, PCR Protocols: A Guide to Methods and AppUcations, Academic Press, San Diego, CA 
(1990); Freshney, R.L, Culture of Animal Cells: A Manual of Basic Technique, 2nd Ed., Alan 
Liss, Inc. New York, NY (1987); Murray, E.J., ed., Gene Transfer and Expression Protocols, 
pp. 109-128, The Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI, Oxford University 

20 Press, New York (1 997). 

For the purposes of tliis application, a promoter is a regulatory sequence of DNA that is 
involved in the binding of RNA polymerase to initiate transcription of a gene. A gene is a 
segment of DNA involved in producing a peptide, polypeptide or protein, including the coding 

25 region, non-coding regions preceding ("leader") and foUovnng C*trailer") the coding region, as 
well as intervening non-coding sequences ("introns") between individual coding segments 
("exons"). Coding refers to the representation of amino acids, start and stop signals in a three 
base "triplet" code. Promoters are often upstream (" 5' to") the transcription initiation site of 
the corresponding gene. Other regulatory sequences of DNA in addition to promoters are 

30 known, including sequences involved with the binding of transcription factors, including 
response elements that are the DNA sequences bound by inducible factors. Enhancers comprise 
yet another group of regulatory sequences of DNA that can increase the utilization of 
promoters, and can function in either orientation (5 '-3' or 3 '-5') and in any location (upstream 
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or downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous Hgand (e.g. a transcription factor) to the regulatory 
sequence increases transcription, thereby resulting in increased expression of the correqjonding 
target gene. In such a case, interference with transcription by binding a polyamide to a 
5 regulatory sequence would reduce or abolish expression of a gene. 

The promoter may also include or be adjacent to a regulatory sequence known in the art 
as a silencer. A silencer sequence generally has a negative regulatory effect on expression of 
the gene. In such a case, expression of a gene may be increased directly by using a polyamide 
10 to prevent binding of a factor to a silencer regulatory sequence or indirectly, by using a 
polyamide to block transcription of a factor to a silencer regulatory sequence. 

It is to be understood that the polyamides of this invention bind to double stranded 
DNA in a sequence specific manner. The fimction of a segment of DN A of a given sequence, 

15 such as 5'-TATAAA-3', depends on its position relative to other functional regions in the 
DNA sequence. In this case, if the sequence 5'-TATAAA-3' on the sense strand of DNA is 
positioned about 30 base pairs upstream of the transcription start site, the sequence forms part 
of the promoter region (Lewin, Genes VI, pp. 831-835). On the other hand, if the sequence 5'- 
TATAAA-3' is downstream of the transcription start site in a coding region and in proper 

20 register with the reading frame, the sequence encodes the tyrosyl and lysyl amino acid residues 
(Lewin, Genes VI, pp. 213-215). 

While not being held to one hypothesis, it is believed that the binding of the polyamides 
of this invention modulate gene expression by altering the binding of DNA binding proteins, 
25 such as RNA polymerase, transcription factors, TBP, TFIILA, the lymphoid enhancer factor 
protein LEF-1, and other minor groove transcription factor proteins. The efifect on gene 
expression of polyamide binding to a segment of double stranded DNA is believed to be 
related to the function, e.g., promoter, of that segment of DNA. 

30 It is to be understood by oae skilled in the art that the improved polyamides of the 

present invention may bind to any of the above-described DNA sequences or any other 
sequence having a desired effect upon expression of a gene. In addition, U.S. Patent No. 
5,578,444 in Table I lists numerous pathogens in which are foxmd medically significant target 
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sequences for DNA-binding drugs and in Table II lists many non-infectious diseases that may 
be controlled at the level of DNA binding proteins. 

It is generally understood by those skilled in the art that the basic structure of DNA in a 
5 living cell includes both major and a minor groove. For the purposes of describing the present 
invention, the minor groove is the narrow groove of DNA as illustrated in common molecular 
biology references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

To affect gene expression in a cell, which may include causing an increase or a 
10 decrease in gene expression, a quantity of one or more poly amides effective to inhibit 
transcription is contacted with the cell and internalized by the cell. The cell may be contacted 
by the polyamide in vivo or in vitro. Effective transcription inhibiting extracellular 
concentrations of polyamides that can modulate gene expression range from about 10 
nanomolar to about 1 micromolar. Gottesfeld, J.M., et al. Nature 387 202-205 (1997). To 
15 determine effective amounts and concentrations of polyamides in vitro, a suitable number of 
cells is plated on tissue culture plates and various quantities of one or more polyamide are 
added to separate wells. Gene expression following exposure to a polyamide can be monitored 
in the cells or in the medium by detecting the amount of the protein gene product present as 
determined by various tecliniques utilizing specific antibodies, including ELISA and western 
20 blot. Alternatively, gene expression following exposure to a polyamide can be monitored by 
detecting the amount of messenger RNA present as determined by various techniques, 
including northern blot and RT-PCR. 

Similarly, to detemiine effective amoxmts and concentrations of polyamides for in vivo 
25 administration, a sample of body tissue or fluid, such as plasma, blood, urine, cerebrospinal 
fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate tissue source is 
analyzed. Gene expression following exposure to a polyamide can be monitored by detecting 
the amount of tlie protein gene product present as determined by various techniques utilizing 
specific antibodies, including ELISA and western blot. Alternatively, gene expression 
30 following exposure to a polyamide can be monitored by the detecting the amount of messenger 
RNA present as determined by vai*ious techniques, including northern blot and RT-PCR. 
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The polyamides of this invention may be formulated into diagnostic and therapeutic 
compositions for in vivo or in vitro use. Representative methods of formulation may be foimd 
in Remington: Tlie Science and Practice of Pharmacy, 19th ed., Mack PubUshing Co., Easton, 
PA (1995), 

5 

For in vivo use, the polyamides may be incorporated into a physiologically acceptable 
pharmaceutical composition that is administered to a patient in need of treatment or an animal 
for medical or research purposes. The polyamide composition comprises pharmaceutically 
acceptable carriers, excipients, adjuvants, stabilizers, and vehicles. The composition may be in 
10 solid, liquid, gel, or aerosol form. The polyamide composition of the present invention may be 
administered in various dosage forms orally, parentally, by inhalation spray, rectally, or 
topically. The temi parenteral as used herein includes, subcutaneous, intravenous, 
intramuscular, intrastemal, infusion teclmiques or intraperitoneally. 

15 The selection of the precise concentration, composition, and deUvery regimen is 

influenced by, inter alia, the specific pharmacological properties of the particular selected 
compound, the intended use, the nature and severity of the condition being treated or 
diagnosed, the age, weight, gender, physical condition and mental acuity of the intended 
recipient as well as the route of administration. Such considerations are within the purview of 

20 the skilled artisan. Thus, the dosage regimen may vary widely, but can be determined routinely 
using standard methods. 

Polyamides of the present invention are also useful for detecting the presence of double 
stranded DNA of a specific sequence for diagnostic or preparative purposes. The sample 
25 containing the double stranded DNA can be contacted by polyamide linked to a solid substrate, 
thereby isolating DNA comprising a desired sequence. Altematively, polyamides linked to a 
suitable detectable marker, such as biotin, a hapten, a radioisotope or a dye molecule, can be 
contacted by a sample containing double stranded DNA. 

30 The design of bifimctional sequence specific DNA binding molecules requires the 

integration of two sepaiate entities: recognition and functional activity. Polyamides that 
specifically bind with subnanomolar affinity to the minor groove of a predetermined sequence 
of double stranded DNA are linl<ed to a functional molecule, providing the corresponding 
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bifiinctional conjugates useful in molecular biology, genomic sequencing, and human 
medicine. Polyamides of this invention can be conjugated to a variety of functional 
molecules, which can be independently chosen from but is not limited to arylboronic acids, 
biotins, polyhistidines comprised from about 2 to 8 amino acids, haptens to which an antibody 

5 binds, solid phase supports, oligodeoxynucleotides, N-ethybiitrosourea, fluorescein, 
bromoacetamide, iodoacetamide, DL-a-lipoic acid, acridine, captothesin, pyrene, mitomycin, 
texas red, anthracene, anthrinilic acid, avidin, DAPI, isosulfan blue, malachite green, ethyl red, 
4-(psoraen-8-yloxy)-butyrate, tartaric acid, (+)-a-tocopheral, psoralen, EDTA, methidium, 
acridine, Ni(II)»Gly-Gly-His, TO, Dansyl, pyrene, N-bromoacetamide, and gold particles. Such 

10 bifunctional polyamides are useful for DNA affinity capture, covalent DNA modification, 
oxidative DNA cleavage, and DNA photocleavage. Such bifimctional polyamides are usefiil 
for DNA detection by providing a polyamide Unked to a detectable label. Detailed 
instructions for synthesis of such bifianctional polyamides can be foimd in copending U.S. 
provisional application 60/043,444, the teachings of which are incorporated by reference. 

15 

DNA complexed to a labeled polyamide can then be determined using the appropriate 
detection system as is well known to one skilled in the art. For example, DNA associated with 
a polyamide linked to biotin can be detected by a streptavidin / alkaline phosphatase system. 

20 The present invention also describes a diagnostic system, preferably in kit form, for 

assaying for the presence of the double stranded DNA sequence bound by the polyamide of 
this invention in a body sample, such brain tissue, cell suspensions or tissue sections, or body 
fluid samples such as CSF, blood, plasma or serum, where it is desirable to detect the presence, 
and preferably the amount, of the double stranded DNA sequence bound by the polyamide in 

25 the sample according to the diagnostic methods described herein. 

The diagnostic system includes, in an amount sufficient to perform at least one assay, a 
specific polyamide as a separately packaged reagent. Instructions for use of the packaged 
reagent(s) are also typically included. As used herein, the terai "package" refers to a 
30 solid matrix or material such as glass, plastic (e.g., polyethylene, polypropylene or 
polycarbonate), paper, foil and the like capable of holding within fixed limits a polyamide of 
the present invention. Thus, for example, a package can be a glass vial used to contain 
milligram quantities of a contemplated polyamide or it can be a microtiter plate well to which 
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microgram quantities of a contemplated polyamide have been operatively affixed, i.e., linked 
so as to be capable of being bound by tlie target DNA sequence. "Instructions for use" typically 
include a tangible expression describing the reagent concentration or at least one assay method 
parameter such as the relative amounts of reagent and sample to be admixed, maintenance time 

5 periods for reagent or sample admixtxires, temperature, buffer conditions and the Uke. A 
diagnostic system of the present invention preferably also includes a detectable label and a 
detecting or indicating means capable of signaling the binding of the contemplated polyamide 
of the present invention to the target DNA sequence. As noted above, numerous detectable 
labels, such as biotin, and detecting or indicating means, sucli as enzyme-linked (direct or 

10 indirect) streptavidin, are well known in the art. 

Example 1: 
Transcription inhibition in vitro and in vivo. 

15 

A high speed cytosolic extract from unfertilized Xenopus eggs was prepared as 
described in Hai1l, et al., J. Cell Biol 120, 613-624 (1993) . DNA templates for transcription 
were the somatic-type 5S RNA gene contained in plasmid pXlsll (50 ng per reaction; 
Peterson et al. Cell 20, 131, 1980) and the tyrD tRNA gene contained in plasmid pTyrD (100 

20 ng per reaction; Stutz, et al, Genes Dev. 3, 1190, 1989) both from Xenopus laevis. 
Transcription reactions (20 |iL final volume) contained the following components: 2.5 \iL 
extract, 9 ng (12 nM) of TFHIA isolated from immature oocytes (Smith et al. Cell 37, 645, 
1984), 0.6 mM ATP, UTP, CTP, 0.02 mM GTP and 10 ^Ci of [a- ^^P] GTP, and the final 
buffer components 12 mM HEPES (pH 7.5), 60 mM KCI, 6 mM MgCh, 25 jiM ZnCl2, and 8% 

25 (vA^) glycerol. Plasmid DNAs were pre-incubated with polyamides in the same buffer prior to 
adding TFIIIA and other reaction components. RNA was purified and analyzed on a 
denaturing 6% polyacrylamide gel. A Molecular Dynamics Phosphorimager equipped with 
ImageQuant software was used to quantify the ejBFects of the polyamides on the relative 
transcription efficiencies of the 5S and TRNA genes. 

30 

Fibroblasts from a Xenoptts kidney derived cell line (kindly provided by Dr. P. Labhart, 
The Scripps Research Institute) were grown at ambient temperature in 25 cm^ culture flasks in 
Dulbecco's modified Eagle medium containing 10% (v/v) fetal calf serum. Cells were 
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passaged for a minimum of three days prior to the addition of polyamide to the culture 
medium. Incubations were continued for various times and nuclei were prepared by hypotonic 
lysis and used as templates for transcription as described. DNA content was determined by 
measuring the absorbance of an aliquot of the isolated nuclei in 1 % (w/v) sodium dodecyl 
5 sulfate (using an extinction coefficient at 260 nM of 1 AU =50 ^ig/mL DNA). The buffer 
components and labeled and unlabeled nucleoside triphosphates were as for the plasmid 
transcription reactions. Reactions were supplemented with 2 jiL of RNA polymerase in (at 
approximately 50 ^ig/mL) isolated fvom Xenopus oocytes. 

10 Footprinting experiments reveal 90% inhibition of TFIIIA binding in the presence of 5 

nM polyamide. After incubation with 60 nM polyamide in a cell-free extract derived from 
Xenopus oocytes, the transcriptional activity of a 5S gene is inhibited by >80%. When 100 nM 
polyamide is supplied in culture medium containing Xenopus kidney cells, transcription 
complexes on the 5S RNA genes are selectively disrq)ted. These results demonstrate that 

15 pyrrole-imidazole polyamides are cell permeable and can inhibit the transcription of a specific 
gene in living cells. 

Polyamides were synthesized by solid phase methods as described in Baird, E. E. & 
Dervan, P. B. J. Am. Chem, Soa 118, 6141-6146 (1996). The identity and purity of the. 
20 polyamides was verified by NMR, matrix-assisted laser desorption/ionization time of flight 
mass spectrometry (MALDI-TOF-MS), and analytical HPLC. MALDI-TOF-MS: 1, 1223.4 
(1223.3 calculated for M+H); 2, 1222.3 (1222.3 calculated for M+H); 3, 1223.1 (1223.3 
calculated for M+H). 

25 

Tlie eight-ring polyamide (labelled 1 in Figure lb) having the sequence composition 
ImPyPyPy-y-ImPyPyPy-P-Dp was synthesized by solid phase methods and shown to bind the 
six base pair site 5'-AGTACT-3' at subnanomolar concentration (Figure 1). This sequence is 
within the binding site for zinc finger 4 of TFIIIA. Quantitative DNAse I footprint titration 
30 experiments reveal that polyamide 1 of Figure lb selectively binds the six base pair target 
sequence with a dissociation constant, K<j = 0.03 nM, a higher affinity than TFIIIA for its 50 
base pair site (Kj - 1 nM). For controls, mismatch eight-ring polyamides ImPyPyPy-y- 
PyPyPyPy-p-Dp (labelled 2 in Figure lb) and ImPylmPy-y-PyPyPyPy-P-Dp (labelled 3 in 
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Figure lb) were prepared which have 100- fold (IQ =2.0 nM) and 1000-fold (IQ =33 nM) lower 
affinities, respectively, for the 5'-AGTACT-3'site. The models of the bmding of the 
polyamides to the identified target sequences of double stranded DNA are illustrated in Figure 
Ic. 

5 

The left side of Figure la shows a schematic model for the interaction of the nine 
zinc finger protein TFIUA with the 5S ribosomal RNA gene internal control region (ICR): 
The middle section of Figure la shows the sequence of the ICR recognized by finger 4 in 
the minor groove. The six base pair region targeted by the designed "hairpin" polyamide is 
10 enclosed in a rectangle. The right section of Figure la shows the expected complex of 
LnPyPyPy-Y-ImPyPyPy-p-Dp (polyamide 1) with its identified target DNA sequence, 5 - 
AGTACT-3*. Circles witli dots represent lone electron pairs on N3 of piuines and 02 of 
pyrimidines. Circles containing an H represent the N2 hydrogen of guanine. Putative 
hydrogen bonds are illustrated by dashed lines. 

15 

Figure lb shows the structures of polyamides ImPyPyPy-y-ImPyPyPy-P-Dp (1), 
ImPyPyPy-y-PyPyPyPy-P-Dp (2), and ImPylmPy-y-PyPyPyPy-p-Dp (3). The models of 
binding of these polyamides to the target duplex DNA sequence are shown in Figure Ic. 
The filled and empty circles represent imidazole and pyrrole rings, respectively, the curved 
20 line represents y-aminobutyric acid (y), and the diamond represents p-alanine. Hydrogen 
bond mismatches such as G-Py and A-Im are highlighted. 

The effect of polyamide labelled 1 in Figure .lb (ImPyPyPy-y-ImPyPyPy-p-Dp) on 
TFniA binding to a restriction fragment isolated from a 5S RNA geiie-containing plasmid was 

25 examined. Zfl-3, a recombinant TFIIIA analog missing fingers 4-9, binds in the major groove 
of the C-block promoter element (see Figure la). DNase I footprinting demonstrates that zfi-3 
and polyamide 1 can co-occupy tlie same DNA molecule. When 5 nM polyamide 1 was 
preincubated with the same DNA target, the binding of nine finger TFIIIA was inhibited by 
>90% (Figure 2). Figure 2 shows the result of DNase I footprinting analysis of the binding of 

30 polyamide 1 and TFIIIA to the 5S RNA gene ICR. 5 -end-labeled restriction Augments were 
derived from the 5S RNA gene by standard methods and footprinting reactions were as 
described in Clemens, K.R., et al., J. Mol Biol 244 23-35 (1994). Lane 1 shows a DMS G- 
specific sequencing ladder, lane 2, protein-free DNA, lanes 3-5, digestion products obtained in 
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the presence of 0.5 nM, I nM and 2 nM TFIIIA, respectively, lane 6, digestion products 
obtained in the presence of 5 nM polyamide 1. Lanes 7-9 show the digestion products of 
reactions in which die DNA was incubated with 1 for 30 min prior to the addition of TFIIIA 
(at the same concentrations as in lanes 3-5, respectively) and incubated for an additional 30 
5 min prior to DNase treatment. The differential inhibition of zfl-3 and full-length TFIIIA 
provides evidence that finger 4 interacts with or is placed in the minor groove. Polyamide 1 
does not inliibit TFIIIA binding to 5S RNA. 

Transcription of the 5S RNA gene in an in vitro system was monitored in the presence 
10 of increasing concentrations (10-60 nM) of polyamide 1. The results are shown in Figure 3. In 
these experiments, polyamide 1 was added to a 5S RNA gene containing plasmid prior to the 
addition of exogenous TFIIIA (12 nM) and a crude extract derived from unfertilized Xenopus 
eggs. As a control, a tyrosine tRNA gene was included on a separate plasmid in these 
reactions. The tRNA gene has an upstream binding site for polyamide 1, but lacks a predicted 
15 protein-DNA interaction. Both genes are actively transcribed in this system, either individually 
(lanes 1 and 15) or in mixed template reactions (lane 2). Addition of 60 nM polyamide 1 
inhibits 5S gene transcription by >80% (lane 6). Only a small degree of non-specific inhibition 
of tRNA transcription is observed at the concentrations of polyamide 1 required for efficient 5S 
RNA inhibition. The targeted 5S RNA gene is inhibited approximately 10-fold more 
20 effectively than the control tRNA gene. 

In these experiments the DNA templates were incubated with polyamide for 30 min 
prior to the addition of TFIIIA, a cytoplasmic extract prepared fix)m imfertilized Xenopus eggs, 
and labeled and unlabeled nucleoside triphosphates. Reactions contained either the somatic- 

25 type 5S RNA gene (Figure 3a, lane 1), a tRNA*^^ gene (lane 15), or a mixture of both genes 
(lanes 2-14). Reactions contained the following final concentrations of the indicated 
polyamide: 0 nM, lanes 1, 2, 15; 10 nM, lanes 3, 7, 11; 20 nM, lanes 4, 8, 12; 40 nM, lanes 5, 
9, 13; 60 nM, lanes 6, 10, 14. Transcription reactions were stopped after 1 hour at ambient 
room temperature and the RNA products were analyzed on a denaturing polyacrylamide gel. 

30 The positions of 5S RNA and the tRNA primary and processed transcripts are indicated at the 
right of the autoradiogram. In Figure 3b the results are represented grq)hically, with inhibition 
results expressed as the ratio of 5S RNA to tRNA transcription relative to the ratio obtained in 
the absence of polyamide. The error bars represent estimated standard deviations. 
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Mismatch polyamides ImPyPyPy-y-PyPyPyPy-p-Dp (2) and ImPylmPy-y-PyPyPyPy- 
p-Dp (3) did not inhibit 5S RNA transcription at concentrations up to 60 nM. If the TFIIIA- 
DNA complex is first allowed to form, 30 nM polyamide 1 added, and the mixture incubated 
5 for 90 minutes prior to adding egg extract, efficient inhibition (80%) of 5S RNA transcription 
is also observed. Shorter incubation times result in less inhibition. The required incubation 
time of 90 minutes is similar to the measured half-life of the TFIHA-DNA complex and 
supports that polyamide 1 forms a more stable complex with DNA than does TF TTTA . 

10 The effect of the polyamides on 5S gene transcription in vivo was determined (Figure 

4). Xenopus kidney-derived fibroblasts were grown in the presence of increasing 
concentrations of polyamide 1 in the culture medium for various times. Concentrations of 
polyamide up to 1 |iM were not toxic, as measured by cell density, if growth was limited to 
less than 72 hours. Nuclei were prepared from cells by hypotonic lysis and equivalent amounts 

15 of the isolated nuclei from control and treated cells were used as templates for transcription 
with exogenous RNA polymerase III and labeled and unlabeled nucleoside triphosphates. This 
experiment monitors the occupancy of class III genes with active transcription complexes. 5S 
RNA transcription can easily be assessed since the repetitive 5S genes give rise to a prominent 
band on a denaturing polyacrylamide gel. An autoradiogram was taken of the gel and the 

20 following observations made based on the observed autoradiogram. 

Nuclei were prepared from Xenopus kidney-derived fibroblasts that were grown in 
culture in the presence or absence of polyamides. Polyamides were included in culture 
medium (in 2.5 mL of media per 25 cm^ flask) for 24 hours prior to harvesting cells and 

25 isolation of nuclei. Equal amounts of nuclei (containing 5 (ig of DNA) were incubated for 2 
hours in 20 |.tL reactions containing Xenopus RNA polymerase in and labeled and unlabeled 
nucleoside triphosphates. RNA was isolated from these reactions and analyzed on a denaturing 
polyacrylamide gel. The results are shown in Figure 4a in which lane 1 is a 5S RNA marker; 
lane 12 is a tRNA*^^ gene marker; lane 2, cells not exposed to polyamide; lanes 3-11, cells 

30 exposed to the indicated polyamides 1, 2, or 3 at the following concentrations; 100 nM, lanes 3, 
6, 9; 300 nM, lanes 4, 7, 10; 1 |j,M, lanes 5» 8, 1 1. Figure 4b is a graphic representation of the 
effects of polyamides 1, 2 and 3 on 5S RNA and tRNA transcription expressed as the ratio of 
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5S RNA to tRNA gene transcription relative to the ratio obtained in the absence of polyamide. 
Error bars represent estimated standard deviations. 

Concentrations of polyamide 1 as low as 100 nM have a pronomiced and selective 
5 effect on 5S transcription. At higher polyamide concentration, a general decrease in the 
transcriptional activity of the nuclei is observed. However, phosphorimage analysis of this 
experiment reveals that at each concentration tested, the effects of the polyamide are much 
greater on 5S RNA transcrption than on tRNA transcription. Having established that nearly 
maximal inhibition of 5S transcription is achieved with 1 \xM polyamide 1, nuclear 
10 transcription was monitored after various times of cell growth in the presence of the 
polyamide. No inhibition is observed for zero time incubation with polyamide 1 at 1 jiM 
concentration, indicating that disruption of transcription complexes does not occur during or 
after tlie isolation or work-up of cell nuclei. Statistically equivalent levels of 5S transcription 
were observed when the cells were exposed to polyamide 1 for 24, 48 or 72 hours. 

15 

These nuclear transcription experiments indicate that polyamide 1 is able to enter cells, 
transit to the nucleus and disrupt transcription complexes on the chromosomal 5S RNA genes. 
To rule out the possibility that tlie observed inhibition is due to some non-specific toxicity of 
the polyamide rather than to direct binding to the 5S RNA gene, the effects of mismatch 
20 polyamides 2 and 3 in the nucleai* transcription assay were monitored. Only a small effect on 
5S RNA synthesis relative to tRNA synthesis is observed with 1 of the mismatch 
polyamides 2 or 3 in the culture medium for 24 hours. This result indicates that the general 
inhibition of transcription obsei*ved with high concentrations of polyamide 1 may be a 
secondary effect of the inhibition of 5S RNA synthesis in vivo, rather than the result of non- 
25 specific polyamide interactions. Polyamide 2 affects a small enhancement of 5S RNA 
transcription in vitro and in vivo, indicating that polyamides may be able to upregulate 
transcription in certain cases. 
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Example 2: 

Design and Synthesis of Optimized Polyamides 
Targeted to HIV-1 Promoter Sequences 

5 The experiments of Example 1 showed that an eight-ring "4-y-4 motif polyamide 

having the sequence composition ImPyPyPy-y-ImPyPyPy-P-Dp binds specifically to the six 
base pair site 5'-AGTACT-3'. It has also been shown recently that the 10-ring, "S-y-S motif 
polyamide ImPyPyPyPy-y-ImPyPyPyPy-p-Dp specifically binds a seven base pair 5'- 
WGWWWCW-3' target sequence with subnanomolar affinity in a hairpin conformation 

10 (Trauger, J.W., et al, J, Am. Chem, SocM^ 6160-6166, 1996). Applying the polyamide 
pairing rules to the 5-y-5 molecular template suggested that the 10-ring hairpin polyamide 
ImPyPylmPy-y-ImPyPylmPy-P-Dp would bind the HIV-1 target sequences 5'-WGCWGCW- 
3*. However, as reported below, polyamide polyamide ImPyPylmPy-y-ImPyPylmPy-p-Dp 
specifically binds the target site 5'-TGCTGCA-3', but with relatively low affinity, 

15 necessitating development of a second-generation polyamide. 

Design of Second Generation Polyamide 4. The simple amino acid p-alanine has 
been employed effectively as a single base pair-spanning linker in many different polyamides. 
In particular, in many cases in which a polyamide binds with relatively low affinity due to an 

20 apparent register mismatch between polyamide residues and their target DNA bases, 
substitution of one or more pyrrole residues with the flexible spacer p-alanine results in marked 
increases in binding affinity. These results suggested that the 8-ring polyamide LnPy-p-ImPy- 
y-ImPy-P-ImPy-P-Dp (4), which differs from 1 only by the replacement of two pyrrole residues 
with P-alanines, would specifically bind the HIV-1 target sequences 5'-WGCWGCW-3' with 

25 high affinity using a novel 2-P-2-y-2-p-2 motif Equilibrium association constants were 
determined by quantitative DNase I footprint titration experhnents of the formally matched 
(according to the polyamide pairing rules) polyamides 1 and 4 and the formally mismatched 
polyamides 2, 3, 5, and 6 for the HIV-1 target sequence 5'-TGCTGCA-3' (Figures 5b and 6). 
To further demonstrate the general usefulness of the 2-p-2-y-2-p-2 motif, the binding of 2 and 

30 5, and 3 and 6 to their respective fomial match sequences 5'-TGGTGGA-3' and 5'- 

TGTTACA-3' was examined. MPE»Fe^ footprinting and affinity cleavage studies were 
prefonned for a subset of these compounds. Finally, the equilibrium association constants of 
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polyamide 4 for the fflV-1 target sequence 5'-TGCTGCA-3' were determined at either 24 ''C 
or 37 °C, and using either standard polyamide assay solution conditions or approximate 
intracellular solution conditions. 

5 NMR spectra were recorded on a GE 300 instrument operating at 300MHz (^H). 

Spectra were recorded in DMS0-r/(5 with chemical shifts reported in parts per million relative 
to residual DMSO-<ij. Polyamide concentrations were determined by UV absorbance 
measurements made on a Hewlett-Packard Model 8452A diode array spectrophotometer. 
Matrix-assisted, laser desorption/ionization time of flight mass spectrometry was carried out at 

10 the Protein and Peptide Microanalytical Facility at the California Institute of Technology. 
Analytical HPLC was performed on a HP 1090M analytical HPLC using a Rainen CI 8, 
Microsorb MV, 5 m, 300 x 4.6 mm reversed phase column in 0.1% (wt/v) TFA with 
acetonitrile as eluent and a flow rate of 1.0 ml/min, gradient elution 1.25% acetonitrile/min. 
Preparative HPLC was carried out on a Beckman instrument using a Waters DeltaPak 25 x 100 

15 mm, 100 m Ci8 column in 0.1% (wt/v) TFA, gradient elution 0,25%/min. CH3CN. E. coli 
XL-1 Blue competent cells were obtained from Stratagene. Restriction endonucleases were 
purchased from Boeringher-Mannheim or New England Biolabs. Plasmid isolation kits were 
obtained from Promega. Plasmid sequencing was carried at the Sequence Analysis Facility at 
the California Institute of Technology. Sequenase (version 2.0) was obtained fiom United 

20 States Biochemical, and DNase I (FPLCpure) was obtained from Pharmacia, [a- 

32 

Thymidine-5*-triphosphate (^000 Cf/mmol), [a- P]-deoxyadenosine-5*-triphosphate (> 6000 

32 

Ci/mmol), and [a- P]-adenosine-5*-triphosphate were purchased from Du Pont/NEN. Water 
was obtained from a Millipore Milli-Q water purification system. 

Synthesis. Polyamides were prepaied by stepwise solid-phase synthesis using BOC- 
25 protected monomers as described by Baird, E.E.; Dervan, P.B. J. Am. Chem. Soc, 1996, 7i5, 
6141-6146. 

Plasmids. Plasmid pJT2B2 was prepared by hybridizing the complementary 
ohgonucleotides, 

30 5'-CCGGCTTAAG TTCGTGGGCC ATGCTGCATT CGTGGGCCAT GGTGGATTCG 
TGGGCCATGT TACATTCG-3' and 
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5»-TCGACGAATGTAACATGGCC CACGAATCCA CCATGGCCCA CGAATGCAGC - 
ATGGCCCACG AACTTAAG-3', and ligating the resulting duplexes to the large pUC19 Ava 
ySal 1 restriction fragment. 

5 The plasmid was transformed into ExolL and plasmid DNA isolated using standard 

32 

methods, and the sequence of the insert confirmed by sequencing. The 3*- P end-labeled Eco 

RUPvu II restriction fragment was prepared by simultaneous digestion with Eco RI and Pvu II 

32 32 
and 3'-fill-in using Sequenase, [a- P]-deoxyadenosine-5*-triphosphate, and [a- ?]- 

thymidine-5 -triphosphate. Tlie 282 bp fragment was isolated by nondenaturing gel 

32 

10 electrophoresis. The 5'- P-end-labeled EcoRUPvuU fragment was prepared using standard 
methods. See Sambrook, J.; Fritsch, E.F.; Maniatis, T. Molecular Cloning; Cold Spring 
Harbor Laboratory: Cold Spring Harbor, NY, 1989. 

A-specific chonical sequencing was carried out as described by Iverson, BX.; Dervan, 
P.B. Nucleic Acids Res. 1987, 75, 7823-7830. Standard methods as described by Sambrook, et 

15 al. (1989) were used for all DNA manipulations. 

Quantitative DNase I Footprinting. Equilibriimi association constants for polyamide- 
DNA complexes were determined by quantitative DNase I footprint titration experiments. 
Reactions were carried out in a total volume of 400 |iL in the absence of carrier DNA. A 

20 polyamide stock solution or H2O (for reference lanes) was added to a solution containing 3'- 
end-labeled restriction fragment (15,000 cpni) affordmg final solution conditions of either 10 
mM Tris'HCl, 10 mM KCl, 10 mM MgCl2, 5 mM CaCl2, pH 7.0. at 24 °C, or 10 mM 
HEPES^HCl, 140 mM KCl, 10 mM NaCl, 1 niM MgCl2, 1 mM spermine, pH 7.2 as indicated 
in Tables 1 and 2. The solutions were allowed to equilibrate at 24 °C or 37 °C as indicated in 

25 Tables 1 and 2 for 12-16 hr. Cleavage reactions were initiated by the addition of 10 nL of a 
DNase I stock solution (at the appropriate concentration to give -55% intact DNA) containing 
1 mM dithiothreitol, allowed to proceed for 7 min. at 24 °C, or 3.5 min. at 37 °C, and stopped 
by the addition of 50 \iL of a solution containing 2.25 M NaCl, 150 mM EDTA, 0.6 mg/mL 
glycogen, and 30 |iM base-pair calf diynius DNA, and precipitated with 1 mL ethanol. 

30 Reactions were then resuspended in Ix TBE/80% formamide, heated at 85 °C for 10 min. 
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placed on ice, aiid the reaction products separated by electrophoresis on an 8% polyacrylamide 
gel (5% cross-link, 7 M urea) in Ix TBE at 2000 V. Gels were dried, exposed to a storage 
phosphor screen (Molecular Dynamics), and imaged using a Molecular Dynamics 400S 
Phosphorlmager. Data was obtained from the imaged gels by quantitation using ImageQuant 
5 software (Molecular Dynamics). Background-corrected volume integration of rectangles 
encompassing the footprint sites and a reference site at which DNase I reactivity was invariant 
across the titration generated values for the site intensities (Igite) reference intensity (I- 

j^f). The apparent fractional occupancy (Oapp) of the sites was then calculated using the 

equation: 

0 1 site /Iref 
I site /Iref 

where I^gite I^ref ^® ^^^^ reference intensities, respectively, from a control lane to 
which no polyaniide was added. The ([L]jq^, 6app) data points were fit to a general Hill 
equation (eq. 2) by minimizing the difference between Gapp and Gfit: 

Qfit = 0min + (0max - Qmin ) ^— ^ (2) 

/ ^ 1 + Ka"[L]"tot ^ ^ 

15 where [L]^q^ is the total polyamide concentration, Ka is the equilibrium association constant, 
Gmin and Gj-nax are the experimentally determined site saturation values when the site is 
imoccupied or saturated, respectively. Tlie data were fit using a nonlinear least-squares fitting 
procedure with Ka, Gmax, and Gmin as the adjustable parameters, and with a fixed value for n. 
For hairpin polyamides 1-6, binding isotheniis were adequately fit by Langmuir isotherms (eq. 

20 2, n=l), consistent with formation of a 1:1 polyamide-DNA complexes. For unlinked 
polyamides 7-9 binding to their match sequences, binding isotherms were adequately fit by a 
cooperative isotherm (eq. 2, n=2) consistent with cooperative dimeric binding. Reported 
association constants are the average value obtained from at least three independent 
footprinting experiments. 

TT 

25 MPE'Fe Footprinting and Affinity Cleavage Experiments. Exact bindmg sites 

were determined by MPE»Fe^^ footprinting. Binding orientation was probed using affinity 
cleavage experiments. All reactions were carried out in a total volume of 400 (iL. A stock 

solution of polymiiide (for MPE^Fe^ footprinting), polyamide-EDTA (for affinity cleavage), 
or H2O (for reference lanes) was added to a solution containing 3'- or 5 '-end-labeled 
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restriction fragment (15,000 cpm) affording final solution conditions of 20 mM HEPES, 200 
mM NaCl, 50 |ag/mL glycogen, and pH 7.3, and the solution allowed to equilibrate for 12 hr at 

24 ""C. Next, for MPE^Fe^ footprinting reactions, MPE^Fe^ was added to a final 
concentration of 0.5 ^M and the solution allowed to equilibrate for 10 min. (a 5 

5 MPE^FE^ stock solution was prepared by mixing equal volumes of 10 [iM MPE and freshly 
prepared 10 |iM Fe(NH4)2(S04)2), and for affinity cleavage reactions, freshly prepared 
Fe(NH4)2(S04)2 was added to a final concentration of 1 \iM and the solution allowed to 

equilibrate for 30 min. For both MPE»Fe^^ footprinting and affinity cleavage, cleavage was 
initiated by the addition of dithiothreitol to a final concentration of 5 mM and allowed to 

10 proceed for 30 min at 24 "^C, then stopped by adding 1 mL ethanol. Next, 10 of a solution 
containing calf thymus DNA (140 |xM base-pair) (Pharmacia) and glycogen (Boehringer- 
Mannheim) (2.8 mg/mL) was added, and the DNA precipitated. The reactions were 
resuspended in IX TBE/80% formamide, heated at 85 °C for 10 min, placed on ice, and the 
reaction products separated by electrophoresis on an 8% polyacrylamide gel (5% cross-link, 7 

15 M urea) in IX TBE at 2000 V. Gels were dried and imaged as described above. Relative 
cleavage intensities were determined by volume integration of individual cleavage bands using 
ImageQuant software (Molecular Dynamics). 

Results and Discussion 

20 

Syntliesis. 

Polyamides were synthesized by solid phase methods as described in Baird, E.E.; 
Dervan, P.B. 1 Am, Chem. Soc, 118, 6141-6146 (1996). 

25 Binding Affinities. 

Quantitative DNase I footprint titration experiments (10 mM Tris»HCl, 10 mM KCl, 10 

mM MgCl2, 5 mM CaCl2, pH 7.0, 24 °C) were carried out on the 282 bp, 3'-''^ end-labeled 
pJT2B2 EcoRI/PvuR restriction fragment which contains the three target sites 5'-TGCTGCA- 
3' (the HIV-1 target sequence), 5'-TGGTGGA-3', and 5'-TGTTACA-3' (Figure 7). These 
30 experiments reveal that the 5-y-5 motif polyamide 1 binds the HIV-1 target sequence 5'- 

7 -1 

TGCTGCA-3' witli a relatively low equilibrium association constant, Ka = 8 x 10 M , while 
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no binding of the 5-y-5 polyamide 2 to its target site 5'-TGGTGGA-3' is observed (Ka < 5 x 
7 -1 

10 M ). In contrast, 5-y-5 polyamide 3 binds its target site 5'-TGTTACA-3' with very high 
affinity (Ka = 5 x 10^^ M"^). (Figure 8, Table 1). 

5 Quantitative DNase I footprintirig experiments reveal that the three 2-p-2-Y-2-p-2 motif 

polyamides 4, 5 and 6 selectively target their respective match sequences 5'-TGCTGCA-3' 
(the HIV-1 target sequence), 5'-TGGTGGA-3', and 5'-TGTTACA-3' with equilibrium 

association constants greater than 1 xlO^ M"^ (Figure 8, Table 1), demonstrating that the 2-p- 
2-y-2-P-2 motif allows specific targeting of a range of 7 base pair sequences v^th 
10 subnanomolar affinity, and is generally usefiil for all 5'-WNNWNNW-3' sequences. 
Polyamide 4 specifically binds the HIV-1 target sequence 5'-TGCTGCA-3' with 

subnanomolar affinity (Ka = 2 x 10^^ M"^), and thus represents a solution to the present 
polyamide design problem. 



Tahle 1. Equilibrium associauon consianis fM ')* 


Polyamide 


5-aTCCTCCAi-3* 


.5--aTGCTCCAi-3' 


S-aTGITACAlO' 


S-y-Smotif: 

1 lmPyPyImPy-v-lmPyPyImPy-(i-Dp 

2 ImlmPylmlm-Y'PyPyPyPyPyP-Dp 

3 ImPyPyPyPy-y-ImPyPyPyPy-p-Dp 


8.3 X 10^(1.5) 
<5 X lO' 
1.2 X 10'(0.4) 


< 1 X id' 
< 5 X 10^ 

< 5 X 10* 


< I X lO' 

< 5 X lO' 

5.1 X 10*** (1.1) 


2-p'2'r'2'{^2 motif: 

4 ImPy.p-ImPy-Y-lmPy-P-ImPy-P-Dp^ 

5 ImIm-P-ImIm-TPyPy-[l-PyPy-P-Dp 

6 ImPy.p.PyPy-Y-ImPy-P-PyPy-P-Dp 


1.5 X lO'"* (0.3) 
1.7 X 10* (0.3) 
< 1 X to'* 


< 5 X lo" 
7.6 X lO' (l.D) 

< I X 10* 


< 5 X 10* 

< 1 X 10* 
2.6 X lO' (1.0) 


Unlinked dime r motif: 

7 ImPyPyPyPy-p-Dp 

8 ImPy.P-lmPy.p.Dp 
!>ImPy-P-PyPyp-Pp 


LI .X 10^0.2) 
3.0 X 10* (0.2) 
< 5 X lo' 


< 1 X lo' 

< 2 X 10^ 

< 5x 10* 


3.9 X i0*.(0.6) 

1.5 X 10^0.3) 
2.3 X 10* (O.I) 



•Values rcponed- are ihc mean values from at tcasi three DNasc I footprini titraiion expcrimcnis. The standard 
deviation for each value is indicated in parentheses. The assays were carried out at 24 "C. pH 7,0. in ihc presence of 10 
m.M Tris'HCl. 10 mM KCI. 10 mM MgCU, and 5 mM CaCl;. Nucleotides flanking polyamide binding sites are in 
lowercase type. Association constants corresponding to formally matched complexes are in bold type, ' 

This polyamide binds the site 5'-lgttacatTCCTCCAc-3* adjacent to the 5*-TCTTACA-3' site with an equilibriurri 
association constant of 2.1 x 10* M ' (0.1). 



Failure of Polyamide 1. 



The failure of polyamide 1 to bind the HIV-1 target sequence with high affmity is 
consistent with previous results. For example, the 4-Y-4 motif polyamides ImPyPyPy-y- 
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ImPyPyPy-P-Dp, ImPyPyPy-y-PyPyPyPy-p-Dp and ImlinPyPy-Y-ImlmPyPy-p-Dp bind 

respective 6 base pair targist sequences with subnanomolar affinity (Ka ^ 10^^ M"^), while 
ImPylniPy-y-ImPylmPy-p-Dp and Imlmlmlm-Y-PyPyPyPy-p-Dp bind respective target 

8 1 

sequences with substantially lower affinity (Ka < 10 M" ). Based on these and additional 
5 results, the common feature of hairpin polyamides that bind with high affinity appears to be the 
placement of imidazole residues only at the first and second positions of polyamide subunits, 
coimting from the N-terminal end, althougli in some cases substitution at the third position is 
tolerated. Imidazoles placed beyond the second position of a polyamide subunit are not 
positioned optimally for specific ligand-DNA contacts. 

10 

The 5-Y-5 polyamides 1, 2, aiid 3 are analogs of 4, 5, and 6, respectively, in which the 
p-alanine residues have been changed to pyrrole residues (Figure 6). Polyamides 1 and 2, 
which have imidazole residues beyond the second position of a polyamide subunit, have 
equilibrium association constants for their match sites more than 100-fold lower than their P- 
15 alanine-substituted analogs 4 and 5, respectively (Table 1). Thus, the 2-p-2-Y-2-p*2 motif is 
essential for high-affinity recognition in these cases. In contrast, polyamide 3 having no 
imidazoles beyond the second position of a polyamide subunit binds its match site with an 
equilibrium association constant -20-fold higher than its p-alanine-substituted analog 6 (Table 
!)• 

20 

The following model is consistent with our data: 1) In the 2-P-2-Y-2-P-2 polyamides 4 
and 5 the flexible p-alanine linkers correctly position the imidazole residues following them for 
specific hydrogen bond contacts with their target guanine bases, while in 5-Y-5 polyamides 4 
and 5 the analogous hydrogen bonds cannot form. 2) In contrast, for both 2-P-2-Y-2-P-2 
25 polyamide 6 and polyamide 5-Y-5 3, the pyrroles beyond the second positions within 
polyamide subunits are correctly positioned, and 3 binds witli lower affinity than 6 due to its 
greater conformational entropy. 

These results indicate that the optimal binding positions, or "binding register," is 
30 different for imidazole and pyrrole residues, consistent with previous results which suggest 
that, for subunits composed entirely of rings (i.e. without P-alanine linkers), imidazole residues 
go out of register after 2-3 residues with substantial drops in affinity, while pyrrole residues go 
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out of register after 5 rings with a gradual leveling of affinity before a substantial drop at 8 
rings. Finally, these results indicate that the 2-p-2-y-2-P-2 and 5-y-5 motifs are complementary 
for recognition of 7 base pair sequences, with the optimal motif dependent on the sequence 
composition of the desired target site. 

5 

Relative EfTects of Covalent Linkage with y-Aminobutyric Add. 

Equilibrium association constants for the **xmlinked dimer motif polyamides 7, 8, and 
9 were measured to allow comparison with their corresponding y-aminobutyric acid-linked 
polyamides 3, 4, and 6. Polyamides 4 and 6, which are composed of 2-y-2 subunits, bind with 
10 -6000- fold and --1 000- fold higher affinity, respectively, relative to their respective unlinked 
analogs 8 and 9. Poly amide 3, which is composed five-ring subimits, binds with ~1 00-fold 
higher affinity than its imlinked analog 7. The significantly larger increase in affinity upon 
linkage with y-aminobutyric acid for 8 and 9 relative to 7 is consistent with the greater 
structural rigidity and consequently greater preorganization of 7. 

15 

Exact Binding Sites and Binding Orientations* 

MPE*Fe^^ footprinting experiments carried out with the foxu: polyamides 4, 5, 6, and 3, 
which have subnanomolar binding affinities, confirm that tliese compounds specifically bind 
their respective seven base pair match sites (Figures 9a and 10a). Affinity cleavage 

20 experiments were performed with the EDTA-polyamides 4-E, 5-E, 6-E, and 3-E to identify the 
location of the C-termini of these polyamides when bound to their target sites (Figures 6, 9b, 
10b). The symmetric polyamides 4-E, 6-E, and 3-E produce cleavage patterns at both sides of 
their binding sites as expected, consistent with two distinct binding orientations (Figure 9c). It 
appears that, in general, side-by-side polyamide-DNA complexes preferentially bind in the 5' 

25 to 3' (N- to C-terminus) orientation. Recent results of studies with hairpin polyamides are 
consistent witli formation of 3' to 5'-oriented polyamide-DNA complexes, and suggest that 
such "reversed orientation" complexes typically have affinities ~1 0-fold lower than analogous 
5' to 3 '-oriented complexes. Consistent with tliis trend, polyamide 4 binds the "reversed 
orientation" match site 5'-TCGTCGA-3' with an affinity -1 0-fold lower than it binds the site 

30 5'-TG.CTGCA-3' (see footnote to Table 1). Inconsistent with this trend, however, the 
assymmctric polyamide 5-E produces a cleavage pattern (Figure 10b) consistent with the 
polyamide binding preferentially in the re\'ersed orientation (3' to 5% N- to C-terminus) as 
illusti*ated (for the parent compound 5) in Figure 10c. 
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Effect of Physiological Salt and Temperature. 

Quantitative DNase I footprinting experiments indicate that increasing the equiHbration 
temperature from 24 to 37 and changing the solution conditions from standard 
5 polyamide assay conditions (10 mM Tris-HCl, 10 mM KCl, 10 mM MgCl2, 5 mM CaCl2, pH 
7.0 at 24 °C) to conditions modeling those encountered within a typical mammaUan cell (140 
mM KCl, 10 mM NaCl, 1 mM MgCl2, 1 mM spermine, pH 7.2) has little effect on polyamide 
binding affinity (Table 2). 



Table 2. 


Equilibrium association constants (M ") of 


polyamide 4 for iis maich site 5' 


-TGCTGCA-3*.' 


Buffer* 


Temp. 


5--aTGCTCCAa-3- 


A 


24 'C 


1.5 X 10'^(0.3) 




317 »C 


8.4 X I0'(2.3i) 


B 


24 "C 


1.9 X 10'°(0.6) 




37 "C 


1.1 X 10'°(0.2) 



"Values reported are the mean values from at least three 
DNase I footprint titraiiion experiments. The stamdard 
deviation for each value is indicated in parentheses. 



"Buffer A:. 10 mM Tris*HCl. 10 mM KCl, 10 mM 
MgCl,, and 5 mM CaCl,. pH 7.0 at 24 ^C; Buffer B: 10 
mM HEPES-HCl, 140 mM KCl, 10 mM NaCK 1 mM 
MgCl.. I mM spermine. pH 7.2. 

10 

Example 3: 

Binding of Polyamides to HIV-1 Promoter Sequences 

15 Sequence-specific DNA-binding small molecules that can permeate hxmian cells could 

potentially regulate transcription of specific genes. Pyrrole-imidazole polyamides were 
designed to bind DNA sequences proximal to binding sites for the cellular transcription factors 
TBP and LEF-1 utilized by HIV-1 for RNA synthesis. These synthetic ligands inhibit HIV-1 
transcription in cell-free assays and virus replication in isolated human peripheral blood 

20 lymphocytes. The ability of small molecules to target predetermined DNA sequences located 
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within RNA polymerase II promoters suggests a general approach for regulation of gene 
expression, as well as a mechanism for the inhibition of viral rephcation. 

In Example 1, above, it was demonstrated that an eight-ring hairpin polyamide targeted 
5 to a specific region of the transcription factor TFIIIA binding site inhibits 5S RNA gene 
transcription by RNA polymerase III in Xeiwpus kidney cells. See also Gottesfeld, J.M., et al., 
Nature 387, 202 (1997). Using a similar approach, it has been found that polyamides can 
inhibit HIV-1 transcription in cell-free assays and viral replication in himian lymphocytes. 

10 The HIV-1 enhancer / promoter element contains binding sites for the cellular 

transcription factors Ets-1, LEF-1, NF-kB and SPl along with a canonical TBP binding site 
(TATA element) and an initiator sequence (Figure 11, taken from K. A. Jones & B. M. Peterlin 
Annu, Rev, Biochem, 63, 717 (1994)). Jones & Peterlin (1994) describe the composition of the 
HIV-1 enhancer region sequence. Binding sites for such transcription factors are not optimal 

15 polyamide target sequences because tliey are found in the promoters of many protein-coding 
genes. However, the sequences immediately flanking these transcription factor binding sites 
vary from gene to gene. In some cases, transcription factor flanking sequences are conserved 
for a particulai* gene providing an address for gene-specific targeting of common transcription 
factors. For example, the sequences immediately upstream and downstream from the HTV-l 

20 TATA element are of the form 5*-WGCWGCW-3' (where W = A or T), At least one copy of 
the 5'-WGCWGCW-3' sequence is located adjacent to the HIV-1 TATA element in all 
sequences of the HIV-1 LTR found in the EMBL and Genbank data bases K. Freeh, R. Brack- 
Werner, T. Werner describe the common modulai* structure of viral LTR's. Virology 224, 256 
(1996) However, the 5'-TGCTGCATATAAGCAGCT-3* TATA element is found only in 

25 strains and isolates of HIV-1 and in no other sequence in either data base. The activity of the 
HTV-l promoter, and especially Tat-induced activity, is critically dependent upon this 
sequence. For example, point mutations in the TATA flanking region reduces activated 
transcription by 10-fold without effecting basal transcription. Tat-transactivation is essential 
for HIV-1 RNA synthesis and vims replication. B. Berkhout, K.-T. Jeang describe TAT 

30 induced expression of the HIV-1 long tcmiinal repeats. Virol 66, 139 (1992); H. S. Olsen, C. 
A. Rosen, describe contribution of tlie TATA motif to the TAT-mediated transcriptional 
activation of HIV-1 gene expression. J. Virol, 66, 5594 (1992). M. R. Sadaie, T. Benter, F. 
Wong-Staal. describe site directed mutagenesis of the 2-trans regulatory genes of HTV-l. 
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Science 239, 910 (1988). Although the possibility of mutational change cannot be precluded, it 
would expected that targeting the HIV-1 TATA flanking sequence with a polyamide might 
prove effective for inhibition of HIV- 1 transcription and virus replication. A 5'-WGCWGCW- 
3' sequence which is not conserved in all strains of HIV-1 is located adjacent to the Ets-1 
binding site. An additional search identified a 5'-WGWWCW-3' sequence adjacent to the 
binding site for LEF-1 which was conserved for most reported strains of HIV-1. In order to 
regulate pol n transcription of only HIV-1, polyamides were designed to bind inmiediately 
adjacent to rather than directly to the binding sites for TBP and LEF-1 (Figure 1 1). 

DNase footprint titrations showed that polyamide 1 bound to the site adjacent to the 
LEF-1 binding site with an apparent dissociation constant of 0.1 nM (Figure 12, site Pl-l). The 
labeled DNA was incubated with the following concentrations of polyamide 1 for 45 min prior 
to the addition of LEF-1 DBD to a final concentration of 8 nM (in the reactions of lanes 7 to 
14); no polyamide (lanes I and 7); 10 pM (lanes 2 and 8); 30 pM (lane 9); 0. 1 nM (lanes 3 and 
10); 0.3 nM (lane 1 1); 1 nM (lanes 4 and 12); 3 nM (lanes 5 and 13); 10 nM (lanes 6 and 14). 
After a second incubation period of 45 min, the samples were subjected to DNase I digestion 
and analyzed by gel electrophoresis. The location of polyamide sites Pl-l to Pl-4, LEF-1 sites 
L1-L3, the LEF-1 DNase hypersensitive (HSS) site and the start-site for transcription (+1) are 
shown along the side the autoradiograni. Three additional potential binding sites for polyamide 
1 are present in the HIV-1 promoter (5'-TGTACT-3') and coding sequence (5'-AGATCT-3*) 
and polyamide 1 also binds these sequences (Pl-2 to Pl-4, Figure 12). In agreement with 
previous published studies, recombinant LEF-1 protein (recombinant DNA binding domam) 
bound to its target site with an apparent dissociation constant of 1.5 nM (as determined by 
DNase footprint titrations). Two additional LEF-1 sites are found in the HIV-1 promoter and 
coding sequences (sites L2 and L3, Figure 12). Importantly, polyamide 1 proved to be an 
effective inhibitor of LEF-1 binding to each of these sites with a Ki of 0.1 nM for the enhancer 
site. Importantly, inhibition was observed either when the polyamide was incubated with the 
DNA before adding LEF-1 or after preincubation of the DNA with LEF-1. In similar DNase 
footprinting experiments, tlie mismatch polyamides did not inliibit LEF-1 binding even at a 
100- fold higher concentration that that needed for inhibition by polyamide 1. 
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Based on the results described above, it was expected tliat polyamide inhibition of LEF- 
1 binding would inhibit LEF-1 -dependent transcription in an appropriate in vitro assay system. 
The effects of polyamide 1 and mismatch polyamide 2 on HIV-1 ti-anscription were tested in an 
in vitro system consisting of a cell-fiee extract prepared from cultured human lymphoid H9 

5 cells. This whole-cell extract contains high levels of LEF-1 protein but supports only low 
levels of transcription, suggesting a limitation for other transcription components in this 
extract. The H9 cell extract was supplemented with the HeLa cell-derived nuclear extract in 
order to obtain high levels of transcription. The H9 cell extract stimulates HIV-1 transcription 
-2.5 to 3-fold over tlie level of transcription observed with the HeLa extract alone and 

10 immunodepletion of LEF-1 protein from the H9 extract abolishes this activated transcription. 
Polyamide 1 is an effective inhibitor of HIV-1 transcription in this system: 50% inhibition of 
transcription is obtained in polyamide titration experiments between 10 and 30 nM polyamide 
1 in the reaction. Polyamide 1 fails to inhibit HIV-1 transcription in the LEF-1 -depleted 
extract or with the HeLa extract alone. Similarly, LEF-1 depletion had no effect on CMV 

15 MIEP transcription. The effects of the mismatch polyamide 2 on HIV-1 and CMV 
transcription were tested as additional controls. No potential binding sites for either polyamide 

I or 2 are present in the CMV MIEP sequence. As expected, polyamide 1 fails to inhibit CMV 
transcription and mismatch polyamide 2 fails to inhibit either HIV-1 or CMV transcription. 

Notably, HIV-1 transcription was not inhibited by polyamide 1 at concentrations as 
higli as 1 iiM even thougli binding sites for the polyamide occuiTed near the start-site for 
transcription (-2 to -7) and within tlie transcribed sequence (+20 to +25). A polyamide was 
designed and synthesized in inhibition studies with the human cytomegalovirus (CMV) 
immediate early piomoter. A potential binding site for polyamide 3 (Figure 1 above) is located 
near the start-site for CMV transcription. Similar to our results with polyamide 1 and HIV-1 
transcription, polyamide 3 had no effect on CMV transcription in our reconstituted assay 
system. These results with both the HIV-1 and CMV promoters suggest that RNA polymerase 

II can initiate transcription and read tlirough DNA bound witli a polyamide in the minor 
groove. Thus, a polyamide must interfere with an essential DNA-binding protein to be an 
inhibitor of transcription. 



25 
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Another well-studied minor groove DNA-binding protein is the TATA-box binding 
protein [TBP]. X-ray crystallographic studies of the TBP-DNA complex have clearly shown 
that TBP binds DNA in tlie minor groove and that binding results in a severe distortion of the 
DNA duplex (unwinding and bending). 

5 

Since TATA elements are found in many protein-coding genes (with the exception of 
the so-called "housekeeping" genes), the TATA element itself is not a good candidate for a 
polyaniide target sequence. However, the sequences immediately flanking TATA boxes are 
unique to each individual gene with little or no homology between genes. Thus, by analogy 
10 with our LEF-i results, it would be expected that targeting the sequence inmiediately adjacent 
to a TATA element witli a polyamide might inhibit TBP binding and thus inhibit transcription 
of that gene by interfering with assembly of the RNA polymerase 11 transcription complex. 
Note that it is the binding of the TBP subunit of TFUD that nucleates the assembly of the RNA 
polymerase II transcription machinery for TATA-containing genes. 

15 

As a model system the polyamide 1 binding site was introduced immediately adjacent 
to the HIV-1 TATA element by oligonucleotide-directed mutagenesis. Thus the TATA region 
sequence was changed from 5'-CATATAAGCAGCT-3* (WILD-TYPE) to 5*- 
CATATAAGTACrr-S* (MUTANT, polyamide 1 site underUned), 

20 

The effect of the polyamide on the formation and stabiUty of the TBP-DNA complex 
was deteraiined using a gel mobility shift assay. As observed in previous gel mobility shift 
assays with recombinant TBP, both monomer and dimer TBP-DNA complexes are observed 
with both the wild-type and mutant TATA-box DNA fragments. Polyamide 1 inhibited TBP 

25 binding to the "mutant" HIV-1 promoter DNA probe but did not significantly inhibit TBP 
binding to the wild-type HFV-l promoter probe (Figure 13 A). Similarly, polyamide 1 inhibited 
basal transcription from the "mutant" HIV-1 promoter but not from the wild-type promoter in a 
reconstituted transcription system containing a HeLa cell nuclear extract (Figure 13B, below). 
Approximately 70% inhibition of basal RNA polymerase Il-mediated transcription was 

30 observed in this assay with polyamide 1 at a concentration of 100 nM. Additionally, neither of 
the niismatch polyamides inhibited TBP binding or transcription. 
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Based on these results, another polyamide was designed and synthesized to specifically 
target the HIV- 1 TATA element as described in Example 2 (Figure 14, below): As shown 
below, tliis polyamide (rmPy-P-ImPy-y-ImPy-P-ImPy-P-Dp, called HIV-1 in Figs. 14-116 and 
polyamide 1 in Figs. 18-24) was designed to recognize the sequences immediately 5' and 3' to 

5 the HIV-1 TATA element. The recognition sites for polyamide HIV-1 are of the form 5 - 
WGCWGCW-3" (where W = A or T). Since a pyrrole-pyrrole pair can recognize either an A-T 
base pair or a T -A base pair, polyamide HIV-l will recognize both the 5' and 3' flanking 
sequences of the HIV-1 TATA element. Binding of the polyamide was confirmed by DNase I 
footprinting and an apparent dissociation constant of 0.05 nM (Ka = 2 X 10^*^ M"') was 

10 determined for both sites (Figure 14). This polyamide inhibits HIV-1 transcription in an in 
vitro tianscription assay with the lymphoid cell nuclear extract and a wild-type HIV-1 
promoter but not transcription of a contiol template lacking the binding site (the CMV 
immediate early promoter). A quantitative representation of the data is shown in Figure 15. 
Fifty percent inhibition of HIV-1 transcription is observed at approximately 50 nM polyamide 

15 in the reaction. This concentration corresponds to only a 16-fold excess of polyamide over 
specific binding sites in the HIV-l plasmid DNA (3 nM). 



A corresponding control polyamide was also syntliesized (Imlm-P-Imlm-y-PyPy-p- 
PyPy-P-Dp, named HIV'2 in Figure 16 and polyamide 2 in Figures 18-24). This polyamide 

20 differs from the HIV-1 polyamide only in the placement of the imidazole and pyrrole amino 
acids and recognizes the TATA box region of the HFV-l LTR with at least 100-fold reduced 
affinity relative to the HIV-1 polyamide (K^ = 2 X 10 ^ M''). In Figure 16, the structures of 
polyamides HIV-1, hnPy-P-ImPy-7-ImPy-p-ImPy-p-Pp and HIV-2, Imlm-p-Imlm-y-PyPy-p- 
PyPy-P-Dp, are shown along with polyamide binding models for the HTV-l TATA box region. 

25 The filled and unfilled circles represent imidazole and pyrrole rings, respectively, the curved 
line represents y-aminobutyric acid, and the diamond represents P-alanine. Single hydrogen 
bond mismatches are highlighted. 

As notetl above, the sequences immediately flanking TATA boxes are unique to each 
30 individual gene with little or no homology between genes. The sequences immediately 5' and 
3* to the HIV-1 TATA element are of the form 5'-WGCWGCW-3' (where W == A or T). A data 
base search revealed that at least one copy of the 5*-WGCWGCW-3* sequence is located 
adjacent to the i llV-l TATA element in all sequences of the HIV-1 LTR found in the EMBL 



r 
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and GeiiBank data bases. Furthennore the 5'-TGCTGCATATAAGCAGCT-3* TATA element 
and the other prevalent HIV-1 TATA element (5'-TGCTGCATAAAAGCAGCC-3') are found 
only in the various strains and isolates of HIV-1 and in no other reported sequence in either 
GenBank or the EMBL data base. 

5 

These polyamides were next used in DNase I footprinting experiments and gel mobility 
shift assays with TBP and an HIV-1 LTR restriction fragment or TATA box oUgonucleotides 
(as described above). As expected, the match polyamide HIV'l inhibited TBP binding to the 
HIV-1 TATA element but the mismatch polyamide HIV'2 did not inhibit TBP binding to this 

10 same DNA fragment (Figure 19 A). DNase I footprint titrations of polyamides HIV-1 and -2 
were preformed in the presence or absence of recombinant human TBP (rhTBP) (- or + TBP). 
A radiolabeled HIV-l LTR restriction fragment was incubated for 30 min with the following 
concentrations of polyamides prior to addition of rhTBP where indicated: no polyamide, lanes 
2, 6, 10, 14; 2.5 nM polyamide, lanes 3, 7, 11, 15; 5 nM, lanes 4, 8, 12, 17; 10 nM, lanes 5, 9, 

15 13, 17. After an additional 30 min incubation, samples were digested with DNase I. G+A 
chemical sequencing reactions are shown in lanes 1 and 18. The extent of the footprints 
generated by polyamide HIV-1 and TBP, respectively, are indicated at the sides of the 
autoradiogram. 

20 Complete inhibition of TBP binding was observed at a polyamide concentration of 2.5 

nM, which represents only a five-fold excess of polyamide over DNA binding sites in this 
assay. The HIV-l polyamide did not inhibit TBP binding to an oligonucleotide corresponding 
to the TATA box region of the adenovirus major late promoter (data not shown). The half-life 
of the HIV'l polyamide was determined in an experiment in which the polyamide-DNA 

25 complex was first formed and then challenged with a large molar excess of unlabeled DNA. 
Samples were taken for digestion with DNase after various incubation times and the apparent 
half-life of tlie HIV-1 polyamide-DNA complex is in excess of 2.5 hours. 

Figure 17 illustrates the binding of several classes of polyamides to the DNA sequence 
30 adjacent to the HIV-1 TATA box (Figure 17, sections la-lc) and adjacent to the Ets-1 / LEF-1 
binding sites. As described above, the polyamide HIV-1 (Figure 17, la) is a hairpin molecule 
that binds to a seven base pair sequence of the form 5*-WGCWGCW-3* (where W = A or T). 
A longer Kvelve base pair sequence can be bound by an antiparallel dimer of the non-hairpin 
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polyaniide ImPy-p-PyPyPyPy-P-ImPy-p-Dp (Figure 17, lb). An even longer sixteen base pair 
sequence can be bound by an antiparallel dimer of the hairpin polyamide ImPy-p-ImPy-p- 
PyPyPyPy-P-IniPy-p-IniPy-P-Dp (Figure 17, Ic). Similarly, a six base pair sequence between 
the Ets-1 binding site and the LEF-1 binding site can be bound by the polyamide ImPyPyPy-y- 
5 ImPyPyPy-P-Dp (Figure 17, 2a). The same sequence plus a portion of the LEF-1 binding site 
extending a total of nine base pairs can be bound by the polyamide ImPy-p-ImPyPyPy-y- 
ImPyPyPy-P-PyPy-p-Dp (Figure 17, 2b). 

Example 4: 

10 Inhibition of TBP Binding by Polyamide Ligands 

Figure 18 schematically depicts the HIV- 1 promoter and DNA-binding sites for 
polyamide hairpins designed to target the HIV-1 promoter. In Figure 18A, the DNA binding 
sites for hairpin polyamides designed to target the HIV-1 enhancer and promoter are illustrated, 

15 showing the binding of the polyamide ImPyPyPy-y-ImPyPyPy-p-Dp, here designated 
polyamide 3, also labeled above as polyamide 1 in Figures 1-4 and 12-13. As before, in the 
binding model illustration, the shaded and unshaded circles represent imidazole (Im) and 
pyrrole (Py) rings, respectively, the curved hnes represent y-aminobutyric acid (y), and the 
diamonds represent P-alanine (P). The shaded bar schematic of the HIV-l enhancer and 

20 promoter shows nucleotide positions -170 to the transcription start site at +1. The binding sites 
for the transcription factors USF, Ets-1, LEF-1, NF-kB, SPl and TFIID (TBP) are indicated. 
The binding sites for polyamides 1, called HTV-I above, and 3, called HIV-2 above, are 
indicated. The structures of polyamides ImPy-P-ImPy-y-IniPy-P-ImPy-p-Dp (1), Imlm-p- 
Xmlm-y-PyPy-p-PyPy-P-Dp (2), ImPyPyPy-y-ImPyPyPy-P-Dp (3), and ImPyPyPy-y- 

25 PyPyPyPy-P-Dp (4), are shoAvn in Figure 18 B, where Dp = dimethyaminopropylamide. 
Binding models and measured dissociation constants (IQ/) are shown. Mismatched base pairs 
arehighliglited. 

Ii is known that binding of the TBP subunit of TFIID in the minor groove nucleates 
30 assembly of the RNA polymerase II transcription machinery for TATA-containing genes. In 
general, S. K, Burley, R. G. Roeder describe the biochemistry and structural biology of the 
transcription factor TFIID Ann. Rev. Biochem. 65, 769 (1996); G. P. Verrijzer, R. Tjian 
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describe TAPS which mediate transcription activity and promoter selectivity TIBS 21, 338 
(1996); R. G. Reader describes the role of general initiation factors in transcription by RNA 
polymerase II 7755., 21, 327 (1996). 

5 According to the pairing rules for DNA recognition, the 5 -WGCWGCW-3* sequence 

adjacent to the HIV-1 TATA element is targeted by hairpin polyamide 1 having sequence 
composition IniPy-P-ImPy-y-ImPy-p-ImPy-p-Dp (Figure 18). The imidazole-pyrrole pair 
(Im/Py) targets a G*C base pair and Py/Im targets C*G. Since the p/p pairing recognizes both 
A*T and T*A base pairs polyamide 1 is expected to bind both the 5' and 3* flanking sequences 

10 of the HIV-1 TATA element (Figure 18A). White, S., Baird, E. E. & Dervan, P. B. describes 
the pairing rules for recognition in the minor groove of DNA by pyrrole-imidazole polyamides. 
Chem, & Biol 4, 569o78 (1997); Swalley, S.' E., Baird, E. E. & Dervan, P. B. describe the 
sequence specificity of the p/p pairing Chem Eur, J, 3, 1600 (1997), Quantitative footprint 
titration experiments re\*eal that polyamide 1 binds both TBP sites as well as a 5'-AGCTGCA- 

15 3' match site overlapping the binding site for Ets-1 with an equilibrium dissociation constant 
{K^ of 0.05 iiM (Figure 19A, lanes 3-5). For DNase I footprinting a 400-bp restriction 
fragment containing the HIV-1 enhancer and promoter regions was isolated from pHIV LTR- 
CAT plasmid DNA (obtained from Dr. K. A. Jones of The Salk Institute, La Jolla, CA) (P.L. 
Sheridan et ai, describe the sequence composition of the plasmid pHIV LTR-CAT Genes Dev. 

20 9, 2090 (1995)). Labeled DNA was incubated with polyamide (45 minutes) and digested with 
DNase 1 under single-hit conditions. 

Regions of polyamide protection were detemiined by analysis of the digestion products 
on a 6% denaturing polyacrylamide sequencing gel. Storage phosphorimage analysis of the 

25 data was used to obtain dissociation constants for the binding reactions (using Kodak Storage 
Phosphor Screens and a Molecular Dynamics SF Phosphorhnager). Recombinant himian TBP 
(rhTBP) was obtained from Promega and was used in footprinting and gel mobiUty shift 
experiments as recommended. Tlie reaction mixtures also contained 100 ng of poly dG-poly 
dC per 10 ^iL gel shift reaction or 50 ^L footprinting reaction. For gel mobihty shift assays, 

30 6% polyacrylamide gels (20:1, acrylamide lo bisacrylamide) contained 44 mM Tris-borate, pH 
8.3, 1 niM EDTA, 4 niM MgCl2 and 0.02% (v/v) NP-40. The same buffer was used for 

electrophoresis of 1 mm tliick gels at 1 5 volls/cm for 2 hours at 4 *^C. 
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The HIV-1 TATA region double-stranded oligonucleotide, used as probe for the gel 
shift assay, had the top-strand sequence: 5'- 

CGTCCCTCAGATGCTGCATATAAGCAGCrGCTTTTTGCCTGTACTGGGTC-3', and the 
complementary bottom-strand sequence. Tlie adenovirus major late promoter TATA region 
5 oligonucleotide has the top-strand sequence 5'-GATCGGGGGCTATAAAAGGGGGTGGG- 
3', and the complementary bottom-strand sequence. Each DNA strand (50 ng) was labeled in 
-.32 

separate reactions with y- P-ATP and T4 polynucleotide kinase and the double-stranded 
oligonucleotides were prepared by annealing equal molar amounts of the two complementary 
strands. Gel shift reactions contained 15 fmol of double stranded oligonucleotide (1.5 nM) 
10 and, where indicated in the figure legends, 50 pg of rhTBP (0.14 nM). Footprinting reactions 
contained the HIV-1 DNA probe (0.5 nM), and polyamide and 5 ng of rliTBP (3 nM) as 
indicated. 

The inhibition of TBP binding to the HIV-1 TATA element by polyamide 1 is shown in 
Figure 19. Figure 19A shows DNase I footprint titrations of polyamides 1 and 2 in the 
presence or absence of rhTBP. The radiolabeled HIV-1 LTR restriction firagment was 
incubated for 30 min witli the following concentrations of polyamide 1 or 2 prior to addition of 
rhTBP where indicated (+): no polyamide, lanes 2, 6, 10, 14; 2.5 nM polyamide, lanes 3, 7, 11, 
15; 5 nM, lanes 4, 8, 12, 16; 10 nM, lanes 5, 9, 13, 17. After an additional 30 min incubation, 
samples were digested with DNase I. G+A chemical sequencing reactions are shown in lanes 1 
and 18. The location of the transcription start site, and tlie extent of the footprints generated by 
polyamide 1 and TBP (+1), are indicated at the sides of the autoradiogram. Binding sites for 
polyamide 1 ai'e located adjacent to the TATA element and additional sites are found 
overlapping the bindmg site for the transcription factor Ets-1 (Figui-e 18A) and in the vector 
DNA sequence (upstream of position -170). 

A mismatch control polyamide Imlm-P-Imlm-y-PyPy-p-PyPy-p-Dp (2) which differs 
only in ihe placement of imidazole and pyn ole amino acids binds the HIV-1 TATA box region 
with 100-fold reduced affinity relative to polyamide 1. TBP binds the HFV-l TATA element 
30 with a Kd of -1-3 nM. DNase footprinting assays reveal that polyamide 1 inhibits TBP 
binding at a concentration of 2.5 nM, which represents only a five-fold excess of polyamide 
over DNA binding sites in this assay (Figure 19 A., lane 7). Mismatch polyamide 2 fails to 
inhibit TBP binding (Figure 19 A., lanes 15-17). 
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A gel mobility shift assay which disiinguishes the TBP-DNA complex from free DNA 
and polyamide-DNA complexes was carried out. The results are illustrated in Figure 19B. 

5 Radiolabeled HIV-1 TATA box (lanes 1-5 and 11-15) or adenovirus major late promoter 
TATA box duplex oligonucleotides (lanes 6-10) (11) were incubated with the following 
concentrations of polyamide prior to addition of rhTBP, where indicated (+): no polyamide, 
lanes 1, 2, 6, 7, 11, 12; 50 nM polyamide, lanes 3, 8, 13; 100 nM, lanes 4, 9, 14; 200 nM, lanes 
5, 10, 15. After an additional 30 min incubation, samples were subjected to electrophoresis. 

10 The positions of free DNA probes (F) and the monomer and dimer TBP-DNA complexes (B) 
are indicated alongside the figure. Polyamide 1 inhibits TBP binding to a double stranded 
oligonucleotide corresponding to the HIV-1 TATA box region, while no inhibition is observed 
for control polyamide 2 (Figure 13B). Additionally, polyamide 1 does not inhibit TBP binding 
to the TATA box region of the adenovirus major late (AdML) promoter (5'- 

15 GGGGGCTATAAAAGGGGGT-3') which contains mismatch (underlined) flanking 
sequences. The half-life of the polyamide 1-DNA complex was determined by competition 
experiments to be in excess of 2.5 hours. 

20 Example 5: 

Inhibition of LEF-1 binding. 

Folyamides were designed and syntliesized to target a sequence adjacent to the binding 
site for LEF-1, a cellular transcriptional activator used by HIV-1. LEF-1 is a member of the 
HMG family of minor-gi^ove binding proteins. J. J. Love, et al. describe the structure of the 

25 LEF-l-DNA complex. Nature 376, 791 (1995); J. Kim, F. Gonzales-Scarano, S. Zeichner, J. 
Alwine, Describe replication of HTV-l containing Unker substitution mutations in the -201 to - 
130 region of tlie long terminal repeat J. Virol 67, 1658 (1993);.In addition to acting as an 
architectural transcription factor, LEF-1 possesses a strong ^ra/w-activation domain and has 
been shown to be essential for viral transcription and replication in lymphoid cells. The LEF-1 

30 binding site is immediately flanked on one side by the sequence 5'-AGTACT-3'. According to 
the pol^'amide pairing rules, this sequence should be bound by hairpin polyamide 3 of sequence 
composition ImPyPyPy-y-lmPyPyPy-p-Dp which has ahready been characterized. Trauger, 
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J.W., Baird, E. E. Dervan, P.B. describes the recognition of DNA by designed ligands at 
subnanomolar concentrations. Nature 382, 559-561 (1996). 

Quantitative footprint titration experiments reveal that polyamide 3 binds the 5- 
AGTACT-3' site adjacent to the LEF-1 binding site with Kj of about 0.06 nM. (site P3-1 in 
Figure 20A). DNase I footprint titration experiments were performed as described for TBP 
above. Tliree additional polyamide 3 binding sites present in the HIV-1 enhancer/promoter 
restriction jfragment were bound with Kj about 1 nM: 5*-TGTACT-3' (located at -7 to -2, P3-3 
in Figure 20A); 5*-AGATCT-3' (located at +20 to +25, P3-4 in Figure 20A) and a single base 
mismatch site 5'-TCTACA-3' (located at -106 to -111, P3-2 in Figure 20A). DNase I 
footprinting failed to detect any high affinity sites for mismatch polyamide 4 on the HIV-1 
enhancer/promoter fragment (Figure 20B). Three sites on the HIV-1 promoter/enhancer 
restriction fragment are bound by the 86-amino acid LEF-1 DNA binding domain (DBD), sites 
LI, L2, andL3 (Figure 20A) with IQ/= 1.4, 5.8, and 4.9 nM, respectively. Recombinant LEF-1 
protein containing the 86 amino acid DNA-binding domain (DBD) was the generous gift of J. 
Love (Scripps Research Institute) and was expressed and purified as described. J. J. Love, et al. 
describe the structure of the LEF-l-DNA complex. Nature 376, 791 (1995). LEF-1 was diluted 
into a buffer containing 10 mM Hepes-OH, pH 7.5, 100 mM KCl, 1 mM dithiothreitol, 1 mM 
MgCl2, 10% (v/v) glycerol and 250 ng/mL BSA. 

DNA binding reactions were performed in tlie same buffer (without BSA) and also 
contained 25.0 ng of poly dO-poly dC and the HIV-1 DNA probe at 50 pM in a final volume of 
50 nl. For assessing LEF-1 occupancy by DNase fiDotprinting, tlie intensity of a protein- 
induced DNase I hypersensitive site was used as a halhnark of LEF-1 binding. The LEF-1 
footprint at site LI, cliaracterized by a marked DNase I hypersensitive site (HSS), clearly 
changes to the polyamide footprint in the presence of match polyamide 3. LEF-1 binding is 
inhibited 50% at a polyamide 3 concentration of approximately 60 pM. Thus, polyamide 3 
located in the minor groove immediately adjacent to site LI inhibits LEF-1 DBD binding to 
this site. Polyamide 3 only inhibits LEF-1 binding to sites L2 and L3 at markedly higher 
polyamide concentrations (3 nM and above, lanes 13-14). Lihibition was observed either by 
adding the polyamide to ihc DNA before LEF-1 or after preincubation of the DNA with LEF-1, 
consistent with the relati\ e dissociation constants for the two binding reactions. Mismatch 
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polyaniide 4, which binds to the 5'-AGTACT-3' sequence with >100-fold reduced affinity, 
fails to inhibit LEF-1 binding (Figure 20B & C). 

The inhibition of LEF-1 binding to the HIV-1 enhancer by polyamide 3 is shown in 

5 Figure 20. In Figure 2()A, the 3'-^^P end-labeled HIV-1 LTR restriction fragment was 
incubated with 1 0 pM - 1 0 nM polyamide 1 for 45 min prior to the addition of LEF-1 DBD to a 
final concentration of 8 nM (lanes 7 to 14). After an additional 45 min, the samples were 
subjected to DNase I digestion and analyzed by gel electrophoresis. The polyamide 
concentrations were: no polyamide (lanes 1 and 7); 10 pM (lanes 2 and 8); 30 pM (lane 9); 0.1 

10 nM (lanes 3 and 10); 0.3 nM (lane 11); 1 nM (lanes 4 and 12); 3 nM (lanes 5 and 13); 10 nM 
(lanes 6 and 14). In Figure 20 B, mismatch polyamide 4 does not bind to the the HIV-1 
enhancer/promoter. The labeled DNA was incubated with the following concentrations of 
polyamides for 45 min: no polyamide (lane 1), 10 nM polyamide 3 (lane 2), 1, 3, 10, and 60 
nM polyamide 4 (lanes 3-6). In Figure 20C, mismatch polyamide 4 does not inhibit LEF-1 

15 binding to the HIV-1 enhancer/promoter. The labeled DNA was incubated with the following 
concentrations of polyamide 4 for 45 min prior to the addition of LEF-1 DBD: 1 nM (lanes 1); 
3 nM (lanes 2); 10 nM (lanes 3); and, 30 nM (lanes 4). LEF-1 DBD was added to a final 
concentiation of 8 nM and, after an additional 45 min incubation, the samples were subjected 
to digestion with DNase I. The location of polyamide sites P3-1 to P3-4, LEF-1 sites LI to L3, 

20 the LEF-1 HSS site and the start-site for transcription (+1) are shown alongside the 
autoradiogram. 



Example 6: 

25 inhibition of RN A Polymerase II Transcription 

by Synthetic DNA-Binding Ligands 

The effects of polyamides 1 and 2 on HIV-1 transcription were tested in an in vitro 
transcription assay with a HeLa cell nuclear extract (Figure 21A). The effects of the 
30 polyamides on basal transcription by RNA polymerase II was monitored with a nuclear extract 
prepared from HeLa cells (J. D. Dignani, P. L. Martin, B. S. Shastry, R. G. Roeder, Describe 
eukaryotic gene transcription with purified components Meth. EnzymoL 101, 582 (1983)). The 
human lymphoid cell line H9 (ATCC HTB 176) was grown in suspension culture in RPMI 
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mediuni (Bio Whittaker) supplemented with 10% fetal calf serum (Tissue Culture Biological). 
Whole cell ext acts of H9 cells in log phase growth were prepared by hypotonic lysis and 
contained 20 mg/ml protein, as determined by the Bradford reaction. Run-off RNA transcripts 
of --300 bases (CMV MIEP linked to a guanine-less cassette, plasmid pElB-GL and --500 
5 bases (pHIV LTR-CAT) were obtained with EcoRI-digested plasmid DNA (100 ng per 25 
reaction). 

Polyamide-DNA complexes were allowed to form at ambient temperature for 30 min 
prior to addition of the nucleai* extract. Transcription complexes were allowed to form for 1 h 

10 at 30 °C prior to a transcription step for 1 h at 30 °C with 10 \iCi of a--^^P-ATP, 10 \iM 
unlabeled ATP, and 600 \xM of the remaining imlabeled nucleoside triphosphates. RNA was 
purified b\' extraction with RNAzol (TelTest, Friendswood, TX) and analyzed by 
electrophoresis on a denaturing (8.3 M lu-ea) 6% polyacrylamide gel. Autoradiograms were 
obtained by exposure of the dried gel to Kodak BioMax fihn with DuPont Cronex Ligjitning 

15 Plus intensifying screens for 1 to 1 8 h at -80^C. Relative levels of transcription were estimated 
by storage phosphorimage analysis, 

Polyamide 1, which taigets the HIV-1 TATA box, inliibits basal transcription from the 
HIV-1 promoter mediated by the general transcription factors SPl and RNA polymerase 11 

20 (Figure 21 A, lanes 2-4). HIV-1 transcription was inhibited 50% in the presence of between 50 
and 100 nM polyamide 1 in several independent experiments. Polyamide 1 does not inhibit 
transcription from the CMV major intermediate early promoter (MIEP), which contains a 
mismatched TATA-flanlcing sequence (5'-GAGGTCTATATAAGCAGA-30. The mismatch 
polyamide 2 does not inhibit ti-anscription from either promoter (Figure 21 A). Titrations were 

25 perfomied of polyamide 1 over a wide range of concentrations (1 to 200 nM) with both the 
HIV-1 and CMV templates in the same reaction. Under these conditions, 50% inhibition of 
HIV-1 transcription was observed at 30 nM polyamide 1, which corresponds to a 3-5 fold 
excess of polyamide over binding sites. No inhibition of CMV transcription was observed even 
at the highest concentrations of polyamide 1 (Figure 21B). Polyamide 1, which has an 

30 additional binding site in the HlV-1 enhancer at nucleotide positions -153 to -147 (overlapping 
the Ets-1 site at -149 to -142), inhibits the DNA binding activity of the Ets-1 transcription 
factor. 
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Tlie effects of polyamides 3 and 4 on HIV-1 transcription in an in vitro system 
consisting of a cell-free extract prepared from cultured human lymphoid H9 cells supplemented 
with HeLa cell extract were determined. The H9 extract contains high levels of LEF-1 protein 
but was found to support only low levels of transcription, suggesting a Umited amount of other 
5 transcription components in this extract. The H9 cell exti-act was supplemented with small 
amounts of a HeLa cell-derived nuclear extract in order to obtain high levels of transcription 
(Figure 21C, lane 1). The H9 HeLa cell extract stimulates HIV-1 transcription 2.5-3-fold over 
the level of transcription observed with the HeLa extract alone. Inimunodepletion of LEF-1 
protein from the H9 extract abolishes tliis activated transcription (Figure 21C, compare lanes 1 
10 and 6). 

For ininnmodepletion and western blot analysis the H9 cell extract was depleted of 
LEF-1 protein with antibody to LEF-1 pre-bound to protein A sepharose beads. A mixture of 5 
jil of antiserum and 50 |.tl of a 1:1 (v/v) slurry of protein A sepharose in transcription buffer 
15 supplemented with 2.5 mg/ml bovine serum albumin (BSA) was incubated on a rotator at 4 ^^C 
for 1 hour. For mock inimunodepletion, an equivalent volume of beads was treated identically 
without antibody. Unbound antibody and BSA were removed by brief centrifligation and the 
beads were washed three times with 50 |il of transcription buffer. For inmiunodepletion, the 
packed protein A beads were incubated with 10 |ul of H9 cell extract, 45 nl of transcription 

20 buffer and 7.5 |.tl of 50 mM MgCl2 on a rotator at 4 for 1 hour. The beads were then 
pelleted by brief centrifugation and the supernatant was tested for transcription activity. The 
efficiency of inimunodepletion was determined by subjecting the depleted and mock-depleted 
extracts to SDS-PAGE and western blotting and was found to be greater than 95 %. The blot 
was probed with antibody to LEF-1 (diluted 1:2500, kindly provided by K. Jones, Salk 

25 Institute, La Jolla, California) and detected by enhanced chemiluminescence (Amersham ECL 
kit). Kodak Bio-Max film was used for detection. 

Polyaniide 3 inlnbits HlV-1 transcription in this system (lanes 2-5) with a 50% 
reduction of transcription observed at 10-30 nM polyaniide. Polyaniide 3 fails to inhibit HTV-l 
30 transcription in the LEF-1 -depleted extract (lanes 7-10). Tlie activity of the CMV MIEP was 
observed in both the mock-depleted and LEF-l depleted H9 cell extract, with the result that 
LEF-1 depletion had no effect on CMV transcription. The effect of the mismatch polyamide 4 
on HIV-1 transcription (Figure 21D, lanes 4 and 5) and polyamides 3 and 4 on CMV 
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transcription (Figure 21 D, lanes 6-10) were examined as additional controls. No potential 
binding sites for either polyamide 3 or 4 are present in the CMV MIEP sequence. As expected, 
polyaniide 3 fails to inhibit CMV transcription (Figure 21D, lanes 7-8). The mismatch 
polyamide 4 fails to inhibit either HIV-1 or CMV transcription (lanes 4-5 and 9-10, 
5 respectively). 

In Figure 21 A the inhibition of basal transcription with polyamide 1 is demonstrated. DNA 
templates containing tlic HIV-1 promoter (lanes 1-4, 9-12) and the CMV major immediate early 
promoter (lanes 5-8, 13-16) were incubated with the following concentrations of polyamide 1 

10 (lanes 2-4, 6-8) or polyamide 2 (10-12, 14-16) prior to the addition of a HeLa cell nuclear extract 
(16): no polyamide, lanes 1, 5, 9, 13; 50 nM polyamide, lanes 2, 6, 10, 14; 100 nM, lanes 3, 7, 11, 
15; 200 nM, lanes 4, 8. 12, 16. In Figure 21B the relative levels of HIV-1 transcription/CMV 
transcription are plotted as a fimction of polyamide 1 concentration. Data were obtained fiom 
mixed template reactions containing both DNA templates as described in Figure 21A. In Figure 

15 21C the inhibition of polyamide 3 on LEF-1 -activated transcription is shown. Transcription 
reactions were performed with EcoRI-digested plasmid pHIV LTR-CAT and the H9 whole cell 
extract (2 ^1) supplemented with 1 nl of the HeLa nuclear extract (lanes 1-5) or with 2 nl of the 
LEF-depletcd H9 extract and the HeLa extract (lanes 6-10). Plasmid DNA was incubated with 
polyamide 3 for 15 min prior to addition of cell extracts and other reaction components. The final 

20 concentrations of polyamide in the reaction were: no polyamide (lanes 1 and 6), 10 nM (lanes 2 
and 7), 30 nM (lanes 3 and 8), 100 nM (lanes 4 and 9) and 300 nM (lanes 5 and 10). In Figure 21 
d, transcription with mismatch control polyamide 4 and a control CMV template. Plasmid DNA 
was incubated with polyamide 3 (10 nM, lanes 2 and 7; 100 iiM, lanes 3 and 8) or polyamide 4 (10 
nM, lanes 4 and 9; 100 nM, lanes 5 and 10) for 15 min at ambient temperature prior to addition of 

25 cell extracts and other reaction components. 

Notably, polyamide 3 does not inhibit basal transcription with the HeLa nuclear extract 
although binding sites ibr this polyamide are present at tlie start-site for transcription and 
within the HIV- 1 mRN A coding sequence present in plasmid pHIV-CAT. These observations 
30 suggest tliai RNA polymerase II can transcribe DNA with a polyamide bound in the minor 
groove and that polyaniides are only inhibitory to transcription when these compounds 
interfere with the DNA binding activity of a required transcription factor. 
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Example 7: 
lubibition of RNA Pol II Transcription in 
Human Cells by Synthetic DNA-Binding Ligands 

5 The polyamides of tlie present invention have been demonstrated to inhibit HIV-1 

transcription in virus-infected human cells. In general, M. E. Klotman, S, Kim, A. Buchbinder, 
A. DeRossi, D. Baltimore, F. Wong-Staal describe the kinetics of expression of multiply 
spliced RNA in early HIV-1 infection of lymphocytes and monocytes/*roc. Natl Acad, ScL 
U.S.A. 88, 5011-5015 (1991). Since there are multiple spliced and unspliced species of HIV-1 

10 RNA with different turn-over kinetics, viral transcription was not monitired by measuring 
levels of RNA. Instead, the effects of the polyamides on the levels and kinetics of HIV-1 
replication in isolated human peripheral blood mononuclear cells (PBMC) was observed in 
culture. PBMC were infected with the T cell-tropic HIV-1 strain WEAU1.6, or the 
macrophage-tropic strain SF162. S. J. Clark et al describe high titers of cytopathic virus in 

15 plasma of patients with symptoms of primary human imunodeficiency virus type-1 infection. 
K Engl. J. Med. 324, 954 (1991); P. Borrow, et al., describe antiviral pressiu-e exerted by 
HIV-1 specific cytotoxic T-lymphocytes (CTLS) during primary infection demonstrated by 
rapid selection ofCTL escape virus Nature Med 3, 205 (1997); C. Cheng-Mayer, D. Seto, M. 
Tateno, J. A. Levy describe biologic features of HIV-1 that correlate with virulence in the 

20 hosLScience 240, 80 (1988); C. Cheng-Mayer, C. Weiss, D. Seto, J. A. Levy, Describe isolates 
of HIV-1 from the brain may not constitute a special group of the AIDS virus. Proc. Natl. 
Acad. ScL USA 86, 8575 (1989). Polyamides were added to the culture medium and the levels 
of HIV-1 p24 viral capsid protein in the culture media were determined on subsequent days 
using a standard ELISA assay. 

25 

Human PBMC were separated from whole blood collected from normal adult 
volunteers by density gradient centrifugation as described. D. E. Mosier, et al. describe HTV-l 
infection of human-PBL-SCID mice Science 251, 791 (1991); D. Mosier, R. Gulizia, P. 
Maclsaac, B. Torbett, J. Levy, describe rapid loss of CD4+ T-cells in human-PBL-SCID mice 
30 by noncytopathic HIV isolates Science. 260, 689 (1993). Donors were provided by the General 
Clinical Research Center of The Scripps Research Institute, which is supported by NIH grant 
MOl RR00833. Human PBMC were activated with 2 ng/mL PHA and 20 units/mL of IL-2 

for 2-3 days prior to HlV-1 infection. Each cultiu-e of 5 x 10^ PBMC was infected with 10^ 
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tissue culture infectious doses of HIV- 1 for 24 hours. Free virus was removed by washing the 
cells in medium, and polyaniides added to the culture. In Uie experiment shown in Figure 22C, 
polyamides were added 24 hours prior to virus exposure, and were continuously present 
thereafter. Vims replication in culture was measured by HIV-1 p24 viral capsid antigen 
5 ELISA (DuPont Medical Products, Boston, MA). 



Assays of HIV-1 replication representative of five replicate experiments with five 
human PBL donors are shown in Figures 22 and 23, illustrating resuhs firom two experiments 
employing different donors. Each experiment is controlled by using polyamides which differ 
10 by single atomic substitution firom the inhibitory polyamides. In contiol PBMC cultures with 
no added polyaniide, vij al replication resulted in increasing p24 levels between day 4 and day 
10 of culture (Figure 23 ). Addition of mismatch control polyamides 2 and 4 had no effect on 
the level of vims in tlie medium, either alone (Figure 22, A, B, & D) or in combination (Figure 
22C). 

15 

In contrast, both polyamides 1 and 3 added to the culture medium reduced p24 levels in 
a dose-dependent maimer (Figure 22A & B). Polyamide 1 at 1 ^iM concentration caused an 
80% reduction in vims (Figure 22 A), while polyamide 3 at 10 jiM concentration caused a 60% 
reduction (Figure 22B). These inhibitory effects were clear at 6 or 8 days of culture, but 
20 became less pronounced at later times. The two polyamide ligands, when used individually, 
delay tlie appearance of virus rather than absolutely inhibit virus production. 

The combination of polyamides 1 and 3 at 1 jiM each can act in synergy to reduce viral 
p24 levels to below the threshold of detection (<10 pg/ml; greater than 99.9% inhibition of 
25 viral rephcation) (Figure 22D, 23A & C), and were as effective as 1 ^iM azidothymidine (AZT) 
in blockmg HIV-1 replication (Figure 22D). The macrophage-tropic SF162 isolate, which 

repUcatcs in both macrophages and CD4"^ T lymphocytes, v/as more difficult to inhibit with 
single polyamides, but I he combination of 1 fiM polyamides 1 and 3 was able to reduce and 
eventually block its replication (Figure 23 A). These results demonstrate that cell-permeable 
30 synthetic DNA ligands can effectively inhibit HIV-1 rephcation in isolated himian 
lymphocytes in culture and suggest that inhibition of vims replication is due to inhibition of 
transcription factor-DNA interactions and gene transcription of viral and possibly cellular 
genes by RNA polymerase n. The observed polyamide inhibition of vims replication is likely 
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due to interference with the DNA binding activities of TBP and Ets-1 by polyamide 1 and the 
binding activity of LEF-1 by polyamide 3. The inhibitory effects of polyamides singly or in 
combination was not due to obvious toxicity. No significant decrease in cell viability was 
apparent in PBMC cultures treated with polyamides 1 and 3 for 10 days, in contrast to 90% 
5 mortality observed for PBMC cells treated with 1 ^iM AZT for the sanie period (Figure 23B & 
D). Cell viability was not impacted by polyamide treatment, but was reduced by AZT 
treatment. 

The combination of polyamides 1 + 3 inhibited HIV-1 replication, but the closely 
10 related polyamides 2 + 4 did not. The simplest explanation for this finding is that polyamides 
1 + 3 inhibited HIV-1 RNA transcription in cells as well as in the in vitro assays (Figure 22), 
but it is possible that inhibition of cellular genes involved in T cell activation could result in an 
indirect effect on HIV-1 replication. To further assess this possibility, a sensitive RNAase 
protection assay was performed for transcripts of a number of cytokine genes, including IL-2, 
15 IL-5, and IL-13 whicli differ by only single base mismatches J&om the target sequences 
flanking the TATA box in the HIV-1 LTR. Four other cytokine genes that lack binding sites 
for eitlier polyamide 1 or 3 in their promoters were also examined (Figure 24A). The results 
(Figure 24B) show thiit exposure of activated human PBMC to a combination of either 
polyamides 1 + 3 or 2 + 4 (1 \iM each) for 6 days failed to inhibit cytokine RNA expression. 
20 This lack of inliibition of cytokine gene transcription suggests that the polyamides reduce virus 
replication in cells by a direct effect on HIV-1 RNA transcription. 

Our present studies have utilized polyamides designed to target DNA sequences 6-7 bp 
in length and have shown that these compounds are effective inhibitors of gene transcription in 

25 cell-free systems and, viila infra, in human cells. Because sequences of these lengths would be 
highly redundant in the human genome, it had seemed likely that these ligands would have 
deleterious effects on cell metabolism due to interference witli the activity of cellular genes. 
However, the results described here indicate that a battery of polyamides which recognize 6-7 
bp sequences will be sufficient for gene-specific regulation in vivo. It is interesting to compare 

30 these small molecule transcription factors to eukaryotic transcriptional regulatory proteins 
which also recognize multiple sequences of similar length in tandem in order to increase 
fimctional specificity. Trie observations that polyamides do not interfere with pol II elongation, 
and that polyamides can bind simultaneously with certain major groove binding proteins 
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should further enhance gene-specificity. M. G. Oakley, M. Mrksich, P. B. Dervan describe 
simuhaneous binding of polyamide in the minor groove with a protein in the major groove. 
Biochemist/y 31, 10969 (1992). 



5 As shov^n in Figure 16, polyamides are not limited to 6-7 base-pair recognition, but can 

bind as cooperative dimers to sequences up to sixteen base pairs in length. Trauger, LW,, 
Baird, E. E. Mrksich, M., Dervan, P.B. describes the recognition of 9-13 base pairs of DNA by 
a p-alaiiine linked extended polyamide dimer. 

10 The specific inliibition of genes transcribed by RNA polymerase 11 represents an 

important first step toward asking whether cell-permeable small molecule transcription 
antagonists might regulate gene expression in complex organisms. TBP and sequences 
adjacent to the TATA element for inhibition of basal transcription by RNA polynierase 11 were 
chosen. Since most tissue-specific cellular genes and viral genes contain TATA elements, this 

15 approach is generally applicable for the inhibition of most tai get genes. 

The inliibition of HIV-1 replication in peripheral blood mononuclear cells by 
polyamides is shown in Figure 22. Panels A-D depict three separate experiments in which 
polyamides alone (A, B) or in combination (C, D) were added to cultures of human peripheral 

20 blood mononuclear cells (PBMC) stimulated 3 days earlier with phytohemagglutinin and 
interleukin-2. PBMC cultures were infected with the primary HIV-1 isolate WEAU 1.6 
(kmdly provided by G. Shaw), and virus replication measured by release of p24 capsid antigen 
into the medium that was detected by antibody-capture ELISA (Dupont). Each experiment 
involved a separate PBMC donor. Values shown are for day 6 or day 8 after virus infection. 

25 When two polyamides were combined, the concentration shown is for each component of the 
mixture. In panel D, the combination of 1 ^M polyamide 1 + 1 |iM polyamide 3 cooperatively 
blocked virus infection and p24 release (below 10 pg/ml, the detection limit of the assay), as 
did the addition of 1 |iM azidothymidine (AZT). Assays of p24 concentration were performed 
in duplicate and showed less than 5% variation from the mean value reported. 

30 

Figure 23 shows the kinetics of the inhibition of HIV-1 replication and effects on cell 
viability by a combination of polyamides added to PBMC cultures. PBMC were isolated as in 
Figure 22 and infected either with the macrophage-tropic HIV-1 isolate SF162 (panels A and 
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B) or the T-cell tropic isolate WEAU 1.6 (panels C and D). Polyamides 1 and 3 were added in 
combination at 1 each. Although each polyamide alone was not very effective in blocking 
replication of HIV- 1 SF162 (data not shown), the combination of both polyamides was able to 
stabilize and later reduce virus p24 production to undetectable levels (panel A, filled triangles) 
5 while untreated cultures (open triangles) continued viral replication. Cell viability was 
determined at days 4 and 10 of culture by trypan blue exclusion. 

The combination of polyamides 1 and 3 caused a small decrease in viability compared 
to cultures that were untreated (panel B, filled versus open triangles), but parallel cultures 

10 treated with 1 jiM azidolhymidine (AZT) showed a much more severe decline in cell viability. 
AZT completely inhibited virus replication of both HIV-1 SF162 and WEAU 1.6 at this 
concentration (data not shown). In contrast to these results, each polyamide alone partially 
inhibited replication of HIV-1 WEAU 1.6 (see Figure 23 A & B), and the combination of 
polyamides 1 and 3 blocked replication at all times examined (panel C, filled circles) compared 

15 to untreated cultures (open circles). Cell viability was higher in cultures treated with 
polyamides 1 and 3 than in untreated cultures (panel D, filled circles versus open circles), 
probably because the cytopathic effect of HIV-1 infection was completely reversed. AZT 
again showed substantial toxicity for cells. Assays of p24 concentration were performed in 
duplicate and showed less than 5% variation from the mean value reported. Similar results 

20 were obtained in two additional experiments. 



A lack of inhibition of cytokine gene expression with polyamides targeted to the HIV-1 
promoter and enhancer sequence can be seen in Figure 24. In Figure 24A the sequences of 
the TATA-box region (taken fi*om GenBank listings) of each of the cytokine/growth factor 

25 genes examined are shown with the TATA box in bold and the binding site for polyamide 1 
underlined. Single base mismatches are indicated in lower case. In Figure 24B, hxraian 
peripheral blood mononuclear cells were cultured under the same conditions used for HIV-1 
infection. Cultures were either left untreated, or 10 p.M of polyamides 1 + 3 or polyamides 2 + 
4 added. After six days, cells were harvested, total RNA extracted, and a ribonuclease 

30 protection assay performed with riboprobes specific for the indicated cytokines as well as for 
CD4, CDS and ribosomiil protein L32. M. V. Hobbs, et al., describe patterns of cytokme gene- 
expression by CD4+ T-cells fiom young and old mice 7. Immunol 150, 3602 (1993). After 
digestion with RNAse Tl, protected fragments were sepai*ated by polyacrylamide gel 
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electrophoresis, and tlie amount of labeled probe quantitated by Phosphorlmager analysis. 
Data are expressed as ihe intensity of each cytokine RNA relative to the intensity of the 
ribosomal L32 RNA band to standardize for RNA loading. There was no difference in the 
intensity of CD4 and CDS RNA bands between groups, indicating equivalent recovery of CD4 
5 and CDS T cells in all cultures. Similar results were seen when the cultured cells were 
analyzed after 10 days, although the levels of RNA for most cytokines had declined in both 
polyamide treated and untreated cells. 



10 

Example 8: 
Differential Inhibition of Ets-1 
Binding With Two Polyamides 

Ets-1 is another cellular transcription factor required for higli levels of HIV-1 RNA 
15 synthesis (Figure 25). Figure 25 shows the schematic structures of polyamides targeted to the 
Ets-l/LEF-1 region of the HIV-1 enhancer and the sequence of the binding sites for these 
polyamides. Polyamides 1 and 2 have been described above in the context of the LEF-1 
inhibition experiments above . Polyamide 3 (ImPy-p-ImPy-y-ImPy-p-ImPy-p-Dp) binds the 
sequence 5'-TGCTGCA-3' with a Kd of 0.05 nM, while the mismatch polyamide 4 (Imlm-p- 
20 Imlm-y-PyPy-P-PyPy-P-Dp) exhibits 100 fold lower affinity for binding that site. The binding 
sites for polyamides 1 and 3 are located immediately downstream (polyamide 1) and upstream 
(polyamide 3) of the Els-1 DNA recognition sequence in the HIV-1 enhancer region (Figure 
25B). Polyamide 3 also recognizes sequences immediately flanking the TATA box of the 
HIV-1 promoter, and was shown to inhibit DNA-bindmg of TBP and prevent basal 
25 transcription from this promoter. 

Schematic representation of the HIV-l enhancer/promoter region with the DNA- 
binding sites for the transcription factors USF, Ets-1, LEF-1, NF-kappaB, Spl and the TATA- 
binding protein TBP, The positions of the polyamide binding sites are indicated by vertical 
30 arrows. The DNA-binding sites for polyamides 1 and 3 flanking the Ets-1 recognition site are 
shown in detail below. The Ets-1 binding site is boxed, the GGA core recognition sequence, 
that is conserved in most Ets-protein binding sites, is indicated by a small rectangle. The LEF-1 
binding site is underlined. 
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Since the Ets-1 binding site is flanked by and partially overlaps with both polyamide 1 
and polyamide 3 binding sites, whether Ets-1 binding could be inhibited by either one, or both 
of these two polyamides was tested by gel mobiUty shift experiments. When polyamides were 

5 preincubated with a radiolabeled double-stranded HIV oligonucleotide before adding Ets-1 
AN331, polyamide 1 had no effect on Ets-1 DNA-binding, even at a concentration as high as 
400 nM (Figure 26 lanes 3-5). Polyamide 3, however, prevented the Ets-l/DNA complex 
formation (Figiire 26, lanes 9-11). When both polyamides were combined, the degree of 
inhibition was very similar to the one observed with polyamide 3 alone (Figure 26, lanes 15- 

10 17). Tlie two mismatch polyamides 2 and 4 did not prevent complex formation, either alone or 
combined (Figure 26, lanes 6-8, 12-14 and 18-20). A titration experiment revealed that 
polyamide 3 inhibited Ets-1 AN331/DNA complex formation by 50% at a concentration of 
approximately 6 nM, nearly complete inhibition was achieved between 50 and 200 nM 
polyamide. Mismatch polyamide 4 had virtually no effect in the same concentration range» 

15 

A double-stranded, labeled oligonucleotide coiTcsponding to the Ets-1 binding site with 
flanking regions within the HIV-l enhancer/promoter was used as a probe in gel mobility shift 
assays. Polyamides were preincubated with the DNA before addition of Ets-1, where indicated. 
The concentrations of polyamides were 100 nM (lanes 3, 6, 9, 12, 15 and 18), 200 nM (lanes 4, 

20 7, 10, 13, 16 and 19), and 400 nM (lanes 5, 8, 11, 14, 17 and 20). AN331 protein was added at 
a concentration of 12 nM. Positions of the free probe (F) and bound probe (B) are indicated. 
Figure 27A shows a representation of an autoradiogi am of a representative gel mobility shift 
assay showing the inhibitory effect of polyamide 3 on AN331 binding. The concentration of 
AN331 was constant (12 nM in lanes 2-10), the probe concentration was 50 pM. The 

25 polyamide was preincubated for 20 min at room temperature before addition of the protein, 
followed by an additional 20-30 min incubation on ice. Tlie polyamide concentrations were 
1.56, 3.125, 6.25, 12.5, 25, 50, 100, 200 nM in lanes 3 to 10, respectively. Figure 27B is a 
graphical representation of the decrease of the fraction of bound probe as a function of 
polyamide concentration. Closed squares represent the data points obtained for polyamide 3, 

30 open squares represent mismatch polyamide 4. Figure 27C is a graphical representation of 
the effect of polyamide 3 on Ets-1 binding when added at different time points during the 
binding reaction. Polyamide 3 was added to the probe either prior to addition of Ets-1 (closed 
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squares), or after addition of Ets-1 (open squares), or simultaneously with Ets-1 (closed 
triangles). 

Figure 28 is a representation of the results of DNase I footprint titration experiment 
5 with polyamides 1 and 3 in the absence (-) or presence (+) of 9.6 nM AN331 protein. 
Polyamides were incubated with the radiolabeled probe prior to addition of Ets-1. The 
polyaniide concentrations were 0 nM (lanes 2, 6, 10, 14), 4 nM (lanes 3, 7, 11, 15), 20 nM 
(lanes 4. 8, 12, 16) and 100 nM (lanes 5, 9, 13, 17). Lanes 1 and 18 show a G+A sequencing 
ladder. The regions protected by polyamides and by Ets-1 are indicated alongside the 
10 sequencing ladder. Note the two DNase I hypersensitive sites characteristic of the Ets-1 
footprint. 

Folyamide 3 prevents Ets-1 DNA-binding while polyamide 1 coexists with Ets-1 on 
overlapping DNA binding sites: 

15 

A labeled DNA fragment derived from the HIV-1 enhancer was incubated with each 
polyamide either alone, or followed by addition of Ets-1 and a further 30 minute incubation. As 
expected, both polyamides bound their target site with similar affinities, polyamide 1 even 
bound with a slightly higher affinity than polyamide 3, complete protection was observed with 

20 20 nM polyamide 1, and with 100 nM polyamide 3 (Figure 28, lanes 3-5 and 1 1-13). The Ets-1 
footprint is characterized by two DNase I hypersensitive bands appearing in the center and at 
the 5' boundary of the footprint. These hypersensitive sites disappear with the addition of 20 to 
100 iiM polyamide 3 (Figure 28, lanes 7-9), but remain unchanged with the addition of 
polyamide 1 (Figure 2S, lanes 15-17). The simultaneous presence of polyamide 1 and Ets-1 

25 results in a broadening of the footprint, corresponding to a combined footprint created by Ets-1 
and polyamide 1 (Figure 28, lanes 14-17), while the Ets-1 footprint is replaced by the 
polyamide 3 footprint (lanes 7-9). Based on the structure of the Ets family protein PU.l in 
complex with its DNA recognition element, it has been proposed that DNA recognition is 
mediated by both major groove and minor groove contacts. In the case of Ets-1, mmor groove 

30 contacts would be expected to occur at the 5' end of the recognition element. This structural 
prediction is in complete agreement with our polyamide inhibition experiments. A polyamide 
located at the 5' end of the Ets-1 site is inhibitory to DNA binding whereas a polyamide 
located at the 3' end of the Ets-1 site is not inhibitory to DNA binding. 
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Example 9: 

Inhibition of TBP Binding and Basal Transcription with Polyamides Designed to Bind 
Sequences Adjacent to the TATA Element 

5 

The previous Examples demonstrate that polyamides which bind sequences adjacent to 
the well-conserved TATA sequence upstream from the transcription stai1-site of messenger 
RNA-coding genes inhibit TBP binding and basal transcription by RNA polymerase 11. This 
method is a useful general approach for modulation of gene expression. For example, specific 

10 polyamides have been designed and synthesized that bind to double stranded DNA adjacent to 
the TATA element of the human cytomegalovirus major immediate early promoter (CMV 
MIEP). Polyamide CMV-1, ImlmPyPy-y-ImPyPyPy-P-Dp, binds the identified target 
sequence 5'-AGGTCT-3\ where the last T of this sequence is the 5' T of the TATA element. 
Polyamide CMV-1 binds the CMV MIEP with an apparent dissociation constant of 1 nM and 

15 is an effective inhibitor of TBP binding and basal transcription. Appropriate mismatch 
polyamides do not inhibit either TBP binding or transcription. 

In another example, a polyamide was designed to bind immediately downstream of the 
TATA element found in the human Her-2/neu breast cancer oncogene promoter. This 

20 polyamide, Her2-1, of composition ImPy-p-Pylm-y-PyPy-P-PyPy-P-Dp, binds the sequence 
5'AGAATGA-3' (where the 5' A of this sequence is the 3' A of the TATA element) with an 
apparent dissociation constant of 20 pM and is an effective inhibitor of TBP binding and 
transcription. Her-2/neu is recognized to be over-expressed in several cancers, including 
human gynecologic adenocarcinomas, including those of the ovary, endornetrium, breast, 

25 fallopian tube and cervix (Cirisano, F.D., & Karlan, B.Y., J. Soc. Gynecol Investig. 3 99-105 
(1996)). 

These additional data confirm the generality of this approach and demonstrate that 
polyamides located either upstieam or downstream of the TATA element are effective 
30 transcription inhibitors. Table 3, below, lists the gene promoters, TATA sequences and 
composition of the polyamides which have been shown to be inliibitors of TBP binding and 
transcription. 
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Tables 

Polyamide Inhibition of TBP Binding and Basal Transcription by Targeting 
Sequences Adjacent to the TATA Element 


GENE 


TATA SEQUENCE 


POLYAMIDE 


Artificial 
TATA 


5'-TATAAGTAeTT-3' 


ImPyPyPy-Y-ImPyPyPy-P-Dp 


HIV-1 
Promoter 


5*-TGCTGCATATAA-3' 
5*-TATAAGCAGCT-3' 


ImPy-P-Pylm-Y-PyPy-p-PyPy-p-Dp 


CMV 
MffiP 


5'-AGGTCTATAA-3' 


ImlmPyPy-y-ImPyPyPy-p-Dp 


Her-2/neu 
oncogene 
(Breast Cancer) 


5'-TATAAGAATGA-3 ' 


ImPy-p-Pylm-Y-PyPy-P-PyPy-p-Dp 



Example 10: 

Anti-repression of Polymerase II Transcription by a Designed Ligand: 

5 

Pyrrole-imidazole polyamides can exert a positive effect on transcription by interfering 
with the activity of a specific repressor protein. The human cytomegalovirus (HCMV) IE86 
repressor protein is well suited for this study, because transcriptional repression is dependent 
on IE86 binding to its DNA target site. IE86 negatively regulates the major immediate early 

10 promoter (MIEP) of HCMV by binding to a sequence element (the cis repression signal, crs) 
located between the TATA box and the start of transcription. It was shown that IE86 exerts its 
negative effect on transcription by binding to the crs element, thereby blocking recruitment of 
RNA polymerase 11. The polyamides that specifically recognize the crs, prevent IE86 firom 
binding and relieve transcriptional repression, while a mismatch polyamide or a polyamide 

15 which binds to a nearby site, have no effect. Furthermore, occupancy of the crs element by a 
small polyamide is not sufBcimt to repress transcription, supporting the model that IE86- 
mediated repression results from steric interference with the recruitment of RNA polymerase 
n to the preinitiation complex. 

20 Figure 29 is a schematic representation of the sequence of the HCMV MIEP region 

from position -34 to +7. The TATA box and the repressor bmding site (located at -14 to +1 
relative to the start site of transcription) are boxed. The polyamides are schematically 
represented at their respective DNA binding sites. The shaded and open circles represent 
imidazole and pyrrole rings, respectively, the hairpin jimction is formed by y-aminobutyric 
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acid, and the diamond represents P-alanine. Polyamides 2 and 3 contain an additional amino 
group at the hairpin jxmction. The mismatch in polyamide 4 is boxed. Structures of five 
polyamides are shown schematically: ImPylmPy-y-PyPyPyPy-P-Dp (1), ImPyImPy-2,4D- 
PyPyPyPy-p-Dp (2), Im-p-ImPy-2,4D.PyPyPyPy.p-Dp (3), ImPyPyPy-y-PyPyPyPy-p-Dp 
(4) and ImlmPyPy-y-ImPyPyPy-p-Dp (5), which binds the sequence 5'-AGGTCT-3' adjacent 
to the TATA box. Im = imidazole, Py = pyrrole, y = y-aminobutyric acid, p = p-alanine, 2,4D 
= 2,4 diaminobutyric acid, and Dp =dimethylaminopropylamide. 

The eight ring polyamide 1, LnPylmPy-y-PyPyPyPy-p-Dp, binds the six base pair 
sequence 5 -AGTGAA-3* within the IE86 binding site with an apparent dissociation constant 
of 1 .4 nM. The predicted structure of this complex is shown in Figure 29 (polyamide 1). Two 
additional polyamides were synthesized that bind the same site, but with higher affinities: 
polyamide ImPylmPy-D-PyPyPyPy-P-Dp (polyamide 2, where 2,4D represents 2,4 
diaminobutyric acid) and Im-P-IniPy-2,4D-PyPyPyPy-p-Dp (polyamide 3) bind the sequence 
5*-AGTGAA-3' with KdS of 1 nM and 0.25 nM, respectively. A single atom substitution in 
this molecule, changing an imidazole to a pyrrole (nitrogen to C-H; polyamide 4. shown in 
Figure 29), reduces the affinity of this compound by -30 to 100-fold for its target site. A 
control polyamide, ImlmPyPy-y-ImPyPyPy-P-Dp (polyamide 5) binds the sequence 5'- 
AGGTCT-3' adjacent to the TATA box of the MIEP with a Ka of 1 nM. 

These compounds together with purified recombinant IE86 protein were used in DNase 
I footprint experiments with a singly end-labeled restriction fi*agment derived fiom the CMV 
major immediate early promoter. As predicted, polyamide 1 protects the sequence 5'- 
AGTGAA-3' in the center of the IE86 binding site. Approximately 50% protection is seen 
with 2 nM polyamide, and complete protection is obtained with 200 nM polyamide 1 (Figure 
30, lanes 3-5). Recombinant protein IE86 protects a region extending between positions -24 
and +3. Approximately 250 nM IE86 is required to give complete protection (Figure 30, lanes 
6 and 14). When polyamide 1 is preincubated with DNA for 15 minutes prior to addition of 
protein, the IE86 footprint is partially replaced by the polyamide 1 specific footprint at a 
concentration of 2 nM polyamide, and is completely replaced at a concentration of 200 nM 
polyamide (Figure 30, lanes 7-9). The control polyamide 5, which protects a site 5*-AGGTCT- 
3' immediately upstream of the TATA box, does not interfere with IE86 binding in a 
concentration range of 2-200 nM. Both the IE86-specific footprint and the polyamide 5 



WO9835702 [fi!e:/A\cadmrfs01\fifmdata\lp\FolevPat\PatentDocuments\WO9835702.cpc] 



WO 98/35702 



PCT/US98/02444 



Page 65 of 113 



-63- 

footprint coexist, even though they are only 9 base-pairs apart (Figure 30, lanes 14-17). A 
singly-end labeled restriction fragment was used in DNase I footprinting reactions. Reactions 
contained approximately 250 nM his-tagged E86 where indicated, and polyamide 1 in the 
concentrations indicated at the top of the lanes. Lanes 1 and 18 represent a G+A sequencing 
5 ladder. The extent of the footprints created by polyamide 1, IE86 and the control polyamide 5 
are indicated by brackets alongside the autoradiogram. The location of the TATA box and the 
start of transcription are indicated. 

The same experiment was repeated with polyamides 2 and 3, which bind the same site 
10 5*-AGTGAA-3' with higher affinities than polyamide 1. A complete protection is seen with 5 
and 10 nM polyamide 3 (Figure 31, lane 4), which ate also the concentrations required to 
completely inhibit IE86 from binding (Figure 31, lanes 8 and 9). A partial inhibition of IE86 
binding is detected at 1 nM polyamide 3 (Figure 31, lane 7). A similar resuU was obtained 
with polyamide 2 (data not shown). The mismatch polyamide 4 did not give a footprint on the 
15 CMV fragment in the concentration range used in these experiments (1 - 200 nM), and it did 
not interfere with IE86 binding (Figure 31, lanes 11-13 and 15-17). The IE86 concentration 
was approximately 250 nM. The concentrations of the polyamide are indicated in nM at the top 
of the lanes. The extent of the footprints for polyamide 3 and IE86 protein as well as the TATA 
box and transcription start site are shown at the side of the autoradiogram. 

20 

The effects of the polyamides on MIEP transcription were tested in an in vitro system 
consisting of a cell-free extract prepared from cultured human CEM cells. It is known from 
previous studies that IE86 inhibits transcription by binding to the cis repression sequence, and 
physically blocks the access of RNA polymerase n to the preinitiation complex. Whether 
25 polyamide 1, which has a relatively low binding affinity (in about the same range as IE86), 
could counteract the negative effect of IE86 on transcription was tested. Rim-off transcription 
reactions were performed with linearized plasmid and CEM extract. The concentration of IE86 
that was needed to give approximately 75% inhibition of transcription was empirically 
determined. The polyamides were added at a concentration of 1 ^M. 

30 

Figure 32 shows that IE86 inhibits CMV transcription by approximately 75% (lane 2). 
When polyamide 1 is preincubated with DNA prior to addition of IE86 protein, the level of 
transcription is restored to approximately 75% of the control (Figure 32, lane 3). Interestingly, 
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the presence of polyamide 1 bound to the crs in the absence of IE86, has no effect on 
transcription, even at a concentration of 1 ^iM (Figure 32, lane 4). An unrelated polyamide, 
which has no binding site in the CMV promoter, has no effect on IE86-mediated repression, 
even at a concentration of 1 (Figure 32, lane 5). Thus, polyamide 1 is able to specifically 
5 counteract transcriptional repression mediated by IE86 protein. 

The foregoing is intended to be illustrative of the present invention, but not limiting. 
Numerous variations and modifications of the present invention may be effected without 
departing ftom the true spirit and scope of the invention. 

10 
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1 . A method of modulating the expression of a cellular or viral gene comprising the steps 
of: 

a. identifying a unique target DNA sequence adjacent to the binding site of at least one 
minor groove transcription factor protein; 

b. chosing a polyamide having a subnanomolar affmity for the unique target DNA 
sequence; and 

c. contacting the unique target DNA sequence with a transcription inhibiting amount of 
the polyamide. 

2. The method of claim 1 comprising the additional step of designing the polyamide to be 
specific for the unique target DNA sequence. 

3. The method of claim 1 wherein the minor groove transcription factor protein is chosen 
from the group consisting of TFIHA, TBP, LEF-1, and Ets-1. 

4. A method for modulating expression of a cellular or viral gene controlled by a minor 
groove transcription factor protein using a specifc polyamide including paired 
carboxamide residues, comprising the steps of: 

a. identifymg a target sequence of double stranded DNA adjacent to the binding site 
of the minor groove transcription factor protein, the target sequence having the form 
5'-WNiN2 . , . NniW-3', wherein N1N2 . . . Nm is the sequence to be bound by 
carboxamide residues, wherein each N is independently chosen from the group A, 
G, C, and T, each W is independently chosen from the group A and T, and m is an 
integer having a value from 3 to 6; 

b. selecting a specific polyamide that is selective for identified target DNA sequences 
5'-WNiN2 , . . NniW-3' and having the form 

X1X2 . . . Xin-Y-X(m + 1) . . . X(2m-l)X2m 

c. wherein Xi, X2, Xm, X(in + 1), X(2m - 1)» and X2m are carboxamide residues 
forming carboxamide binding pairs Xi/X2m, X2/X(2m-1)> ^mP^{m + 1)> and y is 
y-aminobuytic acid or 2,4 diaminobutyric acid and Dp is 
dimethylaminopropylamide, where m is an integer having a value from 3 to 6; and 

d. contacting the target sequence with a transcription inhibiting amount of the specific 
polyamide. 
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5. The method of claim 4 further cx)mprising the step of designing the specific polyamide 
of the form 

X1X2 . . . Xm-Y-X(in + 1) , . . X(2m-l)X2m. 
comprising the steps of: 

5 a* representing the identified target sequence as 5 *-WaA . • . jcW-3 wherein a is a first 

nucleotide to be bound by the Xi carboxamide residue, A is a second nucleotide to 
be bound by the X2 carboxamide residue, and x is the corresponding nucleotide to 
be boimd by the Xm carboxamide residue; 
b. defining a as A, G, C, or T to correspond to the first nucleotide to be bound by a 

10 carboxamide residue in the identified sequence; 

selecting Im as the Xi carboxamide residue and Py as the X2m carboxamide 
residue if a = G; 

d. selecting Py as the Xi carboxamide residue and Im as the X2m carboxamide 
residue if a = C; 

15 e. selecting Hp as the Xi carboxamide residue and Py as the X2m carboxamide 

residue if a = T; 

f- selecting Py as the Xi carboxamide residue and Hp as the X2m carboxamide 

residue if a = A; and 
g. repeating steps c - g for 6 through jc until all carboxamide residues are selected. 



20 6. The method of claim 4 jRirther comprising the step of replacing at least one pyrrole 
residue with a p-alanine residue. 

7. A method of inhibiting the replication of a pathogen by administering a transcription 
inhibiting amount of at least one polyamide compoxmd. 

8. The method of claim 7 wherein the pathogen is chosen from the group consisting of 
25 viruses, bacteria, fimgi and protozoans. 

9. The method of claim 7 wherein the pathogen is a retrovirus. 

10. The method of claim 7 wherein the pathogen is HTV-l . 

11. The method of claim 7 wherein the polyamide is chosen from the group consisting of 
ImPyPyPy-y-hnPyPyPy-p-Dp, hnPy-p-ImPy-y-ImPy-p-ImPy-p-Dp, ImPy-p- 

30 PyPyPyPy-P-ImPy-p-Dp, ImPy-P-linPy-P-PyPyPyPy-p-ImPy-P-ImPy-p-Dp, ImPy-p- 

ImPyPyPy-y-ImPyPyPy-p-PyPy-p-Dp, LnlmPyPy-Y-ImPyPyPy-p-Dp, and mixtures 
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thereof wherein Im is N-methylimidazole, Py is N-methylpyrrole, y is y-aminobutyric 
acid, P is P-alanine, and Dp is dimethylaminopropylamide. 

12. A composition comprising: 

a, a transcription inhibiting amount of at least one polyamide chosen from the group 
5 consisting of ImPyPyPy-y-ImPyPyPy-p-Dp, ImPy-p-ImPy-y-ImPy-p-ImPy-p-Dp, 

ImPy-p-PyPyPyPy-p-ImPy-p-Dp, ImPy-p-ImPy-P-PyPyPyPy-p-ImPy-P-ImPy-p- 
Dp, ImPy-P-ImPyPyPy-y-ImPyPyPy-P-PyPy-p-Dp, ImlmPyPy-y-ImPyPyPy-p-Dp, 
and mixtures thereof wherein Ln is N-methyUmidazole, Py is N-methylpyrrole, y is 
y-aminobutyric acid, p is P-alanine, and Dp is dimethylaminopropylamide; and 
10 b. a pharmaceutically acceptable excipient. 

13. The composition of claim 1 2 wherein the composition is suitable for parenteral use. 

14. A method of improving the binding affinity of a polyamide selective for an identified 
target DNA in a viral genome comprising the step of replacing a carboxamide binding 
pair that does not include N-methyl-imidazole carboxamide with a carboxamide 

15 binding pair consisting of P-alanine paired with P-alanine. 

15. A method of inhibiting the binding of the zinc finger protein TFUIA to the 5S 
ribosomal RNA gene internal control region, comprising: 

contacting the 5S ribosomal RNA gene intemal control region with an inhibiting 
amount of a polyamide of the formula ImPyPyPy-y-hnPyPy-P-Dp, wherein Im is N- 
20 methylimidazole, Py is N-methylpyrrole, y is y-aminobutyric acid, P is P-alanine, and 

Dp is dimethylaminopropylamide. 

16. A composition comprising a transcription inhibiting amount of at least one polyamide 
chosen from the group consisting of ImPyPyPy-y-ImPyPy-p-Dp, ImPy-P-ImPy-y- 
ImPy-P-ImPy-P-Dp and mixtures thereof and a pharmaceutically acceptable excipient 

25 suitable for the treatment of HIV- 1 infection. 

17. A method of treating a human patient with an HIV-1 infection comprising the step of 
administering a composition comprising a transcription inhibiting amount of at least 
one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-p-Dp, ImPy- 
p-ImPy-y-ImPy-p-ImPy-P-Dp and mixtures thereof and a pharmaceutically acceptable 

30 excipient. 

18. A method of treating HIV-l infected human blood cells in vitro comprising the step of 
administering a composition comprising a transcription inhibiting amoxmt of at least 
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one polyamide chosen from the group consisting of ImPyPyPy-y-ImPyPy-p-Dp, ImPy- 
P-ImPy-y-ImPy-p-ImPy-p-Dp and mixtures thereof. 

19. A composition comprising a transcription inhibiting amount of hnPy-p-ImPy-y-PyPy- 
P-PyPy-p-Dp and a phannaceutically acceptable excipient suitable for the treatment of 

5 a cancer charactertized by the over-expression of her-2/neu. 

20. A method of treatment of a adenocarcinoma of the ovary, endometrium, breast, 
fallopian tube and cervix comprising administering the composition of claim 19. 
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